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^ (54) Title: METHOD AND SYSTEM FOR PREDICTING NUCLEIC ACID HYBRIDIZATION THERMODYNAMICS AND 
^ COMPUTER-READABLE STORAGE MEDIUM FOR USE THEREIN 

^2 (57) Abstract: Method and system to predict and optimize probe-target hybridization are provided. The method may be imple- 
^ mented using six interactive, interrelated, software modules. Module 1 predicts the hybridization thermodynamics of a duplex given 
the two strands. Module 2 finds the best primer of a given length binding to a given target. Module 3 executes a primer walk to find 
^ alternative binding sites of a given primer on a given target. Module 5 is a combination of Modules 2 and 3. Module 6 finds the 
^ alternative binding sites of a given primer on a given target (Module 3) and calculates the concentration of target with primer bound 
at primary and alternative sites. Module 7 is a combination of Modules 2 and 5 and also calculates the various concentrations. The 
six modules can be operated either through an interactive user interface or using batch file submission as provided by Module 4. The 
program is suited to predict DNA/DNA, RNA/RNA, and RNA/DNA systems. 
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METHOD AND SYSTEM FOR PREDICTING NUCLEIC 
ACID HYBRIDIZATION THERMODYNAMICS AND 
COMPUTER-READABLE STORAGE MEDIUM FOR USE THEREIN 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention 

This invention relates to methods and systems for predicting nucleic 
acid hybridization thermodynamics and computer-readable storage medium for use 
therein. 

2. Background Art 

10 Improvement of the efficiency of hybridization-based techniques 

requires the optimization of the binding between two sequences. Accurate 
prediction of the thermodynamics allows optimal choice of the sequences, 
temperature, and salt conditions. Hence, the prediction of nucleic acid 
thermodynamics is important to optimize techniques like PCR (Saiki et al., 1988), 

15 Southern and Northern blotting (Southern, 1975), antigene targeting (Freier, 1993), 
and Kunkel site-directed mutagenesis (Kunkel et al., 1987). 
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Hybridization prediction is also important for designing DNA 
microchips that have a wide field of application ranging from diagnostics (Hacia, 
1999; Yershov et al., 1996) to gene expression analysis (Ferea et al., 1999) and 
drug discovery (Debouk and Goodfellow, 1999). Microchips contain a large 
5 number of DNA probe sequences that have to be designed to specifically hybridize 
target sequences in a pool of DNA fragments. First, a DNA probe should be 
designed to bind to only one site of only one DNA target. Second, the different 
DNA probe sequences need to hybridize to their targets under the same temperature 
and solution conditions. Moreover, in sequencing by hybridization (Fodor et al., 
10 1993; Mirzabekov, 1994) where microchips are used to determine the sequence of 
given DNA, one has to be able to know hybridization thermodynamics to 
discriminate signals resulting from perfectly matched and mismatched probe/target 
hybridizations. 

Another widely used technique that requires hybridization prediction 
15 is the fluorescence in situ hybridization (FISH) technique (Gall and Pardue, 1969). 
In this technique, a fluorescent tagged nucleic acid probe is designed to specifically 
hybridize cellular or tissue section nucleic acids. The target of these probes can 
either be endogenous DNA, messenger RNA or viral and bacterial sequences. 

Therefore, FISH is used to monitor gene expression (McNicol and 
20 Farquharson, 1997), detect infectious agents (Bashir et al., 1994; McNicol and 
Farquharson, 1997; Pollanen et al., 1993), study cell cycle (McNicol and 
Farquharson, 1997), map chromosomes and study nuclear architecture (Heng et al., 
1997). It was also determined that a set of probes can be used simultaneously 
(multiFISH) to detect different loci (Pagon, 1997). Once again, prediction of 
25 hybridization is essential to insure specificity. Nucleic-acid hybridization prediction 
is also important for the design of oligonucleotide aptamers or antisense 
oligonucleotides (Cohen, 1992) that can be used for various therapeutic applications. 
A new type of probes known as molecular beacons (Bonnet et al., 1999; Tyagi et 
al. 1998) that are very specific has been developed and shown to be efficient for 
30 mutation analysis (Giensendorf et al., 1998) and multiplex detection of single 
nucleotide variations (Marras et al., 1999). The design and prediction of the 
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thermodynamics of these beacons is helped by hybridization thermodynamics 
prediction (Bonnet et al., 1999). Accurate prediction of hybridization is also 
important for the practical realization of DNA-based or more generally nucleic acid- 
based computers. (Adleman, L.M., 1994). 

5 The development of molecular biology techniques based on 

hybridization (PCR, FISH, DNA microchips, etc.) has resulted in a need for 
efficient automated ways to design probes and primers. In the last decade, 
numerous algorithms have been developed to optimize the design of primers and 
probes for various applications (Rychlik and Rhoads, 1989)(Breslauer et al., 1986; 

10 Chen and Zhu, 1997; Dopazo et al., 1993; Haas et al., 1998; Hillier and Green, 
1991;Hyndmanetal., 1996; Lietal., 1997; LinketaL, 1997; Pesoleetal., 1998; 
Proutski and Holmes, 1996). Numerous unpublished software to predict primers 
are also made available by research groups and biotech companies on the World 
Wide Web (Primer3 from the Whitehead Institute for Biomedical Research. Primer 

15 Express™ from PE Biosystems, DNAstar from IDT, etc.) 

There are currently many software packages on the market for DNA 
primer design including: OLIGO, PRIMER PREMIER, OSP, GCG, PrimerMaster, 
and Primo. None of the current programs, however, were written by experts in 
DNA thermodynamics; thus, there are many improvements that can be made. 

20 Nearly all of the current software packages contain mistakes that result from a lack 
of understanding of the underlying theory of DNA hybridization. PCR is a fairly 
robust process and thus even crude programs make predictions that work 90-95% 
of the time. Multiplex PCR primer design, however, is not at all trivial and 
detailed knowledge of the physical chemistry of DNA hybridization, and the 

25 availability of an accurate thermodynamic database are essential to reliable design 
of multiplex PCR primers. In multiplex PCR, several primers must be designed to 
specifically bind to different sites on target DNA at a given temperature with 
minimal background binding to mismatch sites and with minimal cross- 
hybridizations between pairs of primers. The design of molecular beacons for DNA 

30 oligonucleotide arrays is also very challenging because of the complex competing 
equilibria. 
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Most of the existing programs aiming at finding an optimum probe 
that binds a specific location on a target, however, do not include accurate stability 
rules for hybridization and neglect or poorly approximate competitive binding sites, 
strand folding and strand dimerization. 

5 U.S. Patent Nos. 5,593,834 and 6,027,884 to Lane et al. disclose 

methods to design and construct DNA sequences with selected reaction attributes. 

In summary, prediction of nucleic acids thermodynamics is important 
to optimize various molecular biology techniques including multiplex PCR, DNA 
microchips, molecular beacons, and fluorescence in situ hybridization. Most of the 
10 available programs for probe design do not include a complete parameterization and 
often do not account for mismatches. Moreover, single strand folding is not taken 
into account, which often leads to inaccurate predictions. 

SUMMARY OF THE INVENTION 

An object of the invention is to provide a method and system for 
15 predicting nucleic acid hybridization thermodynamics and computer-readable storage 
medium for use therein wherein the invention utilizes a thermodynamically rigorous 
approach to evaluate the quality of probes and simulate probe/target hybridization. 

Another object of the invention is to provide a method and system for 
predicting nucleic acid hybridization thermodynamics and computer-readable storage 
20 medium for use therein wherein the invention also takes into account single strand 
folding thermodynamics to calculate effective hybridization thermodynamics. 

In carrying out the above objects and other objects of the present 
invention, a method for predicting nucleic acid hybridization thermodynamics is 
provided. The method includes providing a database of thermodynamic parameters, 
25 receiving hybridization information which represents at least one sequence, 
receiving correction data, receiving a first set of data which represents hybridization 
conditions, and calculating hybridization thermodynamics including net 
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hybridization thermodynamics based on the hybridization information, the 
thermodynamic parameters, the correction data and the first set of data. 

The hybridization thermodynamics of individual single stranded, 
bimolecular and higher order complexes may be statistically weighted in a numerical 
5 process and the equilibrium concentration of each species is output. 

The correction data may include folding correction data and/or linear 
correction data. 

The thermodynamic parameters may include DNA thermodynamic 

parameters. 

10 The DNA thermodynamic parameters may include dangling end 

parameters and/or coaxial stacking parameters. 

The DNA thermodynamic parameters may further include terminal 
mismatch parameters. 

The thermodynamic parameters may include RNA thermodynamic 
15 parameters and/or hybrid DNA/RNA thermodynamic parameters. 

The thermodynamic parameters may further include DNA loop 
thermodynamic parameters. 

The hybridization information may represent top and bottom strand 
sequences which form a duplex and wherein the hybridization thermodynamics are 
20 calculated for the duplex. 

The hybridization information may further represent at least a section 
of a target and a length of at least one primer or probe complimentary to the target. 
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The hybridization thermodynamics may be calculated for a plurality 
of primers or probes complimentary to the target. 

The hybridization information may represents at least a section of a 
target and a primer or probe. 

5 A length of the target may be longer than a length of the primer or 

probe and wherein the hybridization thermodynamics are calculated for a best 
target/primer or target/probe complex and for competitive mismatch complexes. 

Hybridization information may represent at least a section of a target 
and a primer or probe and wherein a length of a target is longer than the length of 
10 the primer or probe and wherein the hybridization thermodynamics are calculated 
for a best target/primer or target/probe complex and for competitive target/primer 
or target/probe complexes. 

The method may further include calculating concentration of each 
species in a solution at a plurality of temperatures. 

15 Hybridization information may also represent a primer or probe and 

wherein the length of the target is longer than a length of the primer or probe and 
wherein the hybridization thermodynamics are calculated for a best target/primer 
or target/probe complex and for competitive mismatch complexes and wherein the 
method may further comprise calculating concentration of every species in a 

20 solution at a plurality of temperatures. 

The hybridization thermodynamics may be calculated for at least two 
best target/primer or target/probe complexes and for their corresponding 
competitive mismatch complexes and wherein the method may further comprise 
correcting for any interactions between the at least two best target/primer or 
25 target/probe complexes and their components. 
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Further in carrying out the above objects and other objects of the 
present invention, a system for predicting nucleic acid hybridization 
thermodynamics is provided. The system includes a database of thermodynamics 
parameters, means for receiving hybridization information which represents at least 
5 one sequence, and means for receiving correction data. The system further includes 
receiving a first set of data which represents hybridization conditions, and means 
for calculating hybridization thermodynamics including net hybridization 
thermodynamics based on the hybridization information, the thermodynamic 
parameters, the correction data and the first set of data. 

10 The hybridization thermodynamics of individual single stranded, 

bimolecular and higher order complexes may be statistically weighted in a numerical 
process and the equilibrium concentration of each species is output. 

The correction data may include folding correction data and/or linear 
correction data. 

15 The thermodynamic parameters may include DNA thermodynamic 

parameters such as dangling end parameters. 

The DNA thermodynamic parameters may include coaxial stacking 
parameters and/or terminal mismatch parameters. 

The thermodynamic parameters may include RNA thermodynamic 
20 parameters and/or hybrid DNA/RNA thermodynamic parameters. 

The thermodynamic parameters may further include DNA loop 
thermodynamic parameters. 

The hybridization information may represent top and bottom strand 
sequences which form a duplex and wherein the hybridization thermodynamics are 
25 calculated for the duplex. 
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The hybridization information may also represent at least a section 
of a target and a length of at least one primer or probe complimentary to the target. 

The hybridization thermodynamics may be calculated for a plurality 
of primers or probes complimentary to the target. 

5 The hybridization information may represent at least a section of a 

target and a primer or probe. 

A length of the target may be longer than a length of the primer or 
probe and wherein the hybridization thermodynamics are calculated for a best 
target/primer or target/probe complex and for competitive mismatch complexes. 

10 Hybridization information may represent at least a section of a target 

and a primer or probe and wherein a length of a target is longer than the length of 
the primer or probe and wherein the hybridization thermodynamics are calculated 
for a best target/primer or target/probe complex and for competitive target/primer 
or target/probe complexes. 

15 The system may further include means for calculating concentration 

of each species in a solution at a plurality of temperatures. 

Hybridization information may also represent a primer or probe and 
wherein the length of the target is longer than a length of the primer or probe and 
wherein the hybridization thermodynamics are calculated for a best target/primer 
20 or target/probe complex and for competitive mismatch complexes and wherein the 
system may further comprise means for calculating concentration of every species 
in a solution at a plurality of temperatures. 

The hybridization thermodynamics may be calculated for at least two 
best target/primer or target/probe complexes and- for their corresponding 
25 competitive mismatch complexes and wherein the system may further comprise 
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means for correcting for any interactions between the at least two best target/primer 
or target/probe complexes and their components. 

Still further in carrying out the above objects and other objects of the 
present invention, a computer-readable storage medium having stored therein a 
database of thermodynamics parameters and a computer program are provided. The 
computer program executes the steps of: a) receiving hybridization information 
which represents at least one sequence; b) receiving correction data; c) receiving a 
first set of data which represents hybridization conditions; and d) calculating 
hybridization thermodynamics based including net hybridization thermodynamics 
based on the hybridization information, the thermodynamic parameters, the 
correction data and the first set of data. 

The hybridization thermodynamics of individual single stranded, 
bimolecular and higher order complexes may be statistically weighted in a numerical 
process and the equilibrium concentration of each species is output. 

15 , The correction data may include folding correction data and/or linear 

correction data. 

The thermodynamic parameters may include DNA thermodynamic 

parameters. 

The DNA thermodynamic parameters may include dangling end 
20 parameters and/or coaxial stacking parameters. 

The DNA thermodynamic parameters may further include terminal 
mismatch parameters. 

The thermodynamic parameters may include RNA thermodynamic 
parameters and/or hybrid DNA/RNA thermodynamic parameters. 



5 



10 
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The thermodynamic parameters may further include DNA loop 
thermodynamic parameters. 

The hybridization information may represent top and bottom strand 
sequences which form a duplex and wherein the hybridization thermodynamics are 
calculated for the duplex. 

The hybridization information may represent at least a section of a 
target and a length of at least one primer or probe complimentary to the target. 

The hybridization thermodynamics may be calculated for a plurality 
of primers or probes complimentary to the target. 

The hybridization information may represent at least a section of a 
target and a primer or probe. 

A length of the target may be longer than a length of the primer or 
probe and wherein the hybridization thermodynamics are calculated for a best 
target/primer or target/probe complex and for competitive mismatch complexes. 

Hybridization information may represent at least a section of a target 
and a primer or probe and wherein a length of a target is longer than the length of 
the primer or probe and wherein the hybridization thermodynamics are calculated 
for a best target/primer or target/probe complex and for competitive target/primer 
or target/probe complexes. 

The program may further execute the step of calculating 
concentration of each species in a solution at a plurality of temperatures. 

Hybridization information may also represent a primer or probe and 
wherein the length of the target is longer than a length of the primer or probe and 
wherein the hybridization thermodynamics are calculated for a best target/primer 
or target/probe complex and for competitive mismatch complexes and wherein the 
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program may execute the step of calculating concentration of every species in a 
solution at a plurality of temperatures. 

The hybridization thermodynamics may be calculated for at least two 
best target/primer or target/probe complexes and for their corresponding 
competitive mismatch complexes and wherein the program may execute the step of 
correcting for any interactions between the at least two best target/primer or 
target/probe complexes and their components. 

The above objects and other objects, features, and advantages of the 
present invention are readily apparent from the following detailed description of the 
best mode for carrying out the invention when taken in connection with the 
accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a schematic drawing wherein multiple equilibria are 
considered for concentration calculations; 

FIGURE 2a is a schematic drawing of a user input interface wherein 
the user provides various input information for a first module of the invention; 

FIGURE 2b is a schematic drawing of a user output interface wherein 
a computer provides output information corresponding to the input information of 
Figure 2a; 

FIGURE 3a is a schematic drawing of a user input interface wherein 
the user provides various input information for a second module of the invention; 

FIGURE 3b is a schematic drawing of a user output interface wherein 
a computer provides output information corresponding to the input information of 
Figure 3a; 
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FIGURE 4a is a schematic drawing of a user input interface wherein 
the user provides various input information for a third module of the invention; 

FIGURE 4b is a schematic drawing of a user output interface wherein 
a computer provides output information corresponding to the input information of 
5 Figure 4a; 

FIGURE 5a is a schematic drawing of a user input interface wherein 
the user provides various input information for a fifth module of the invention; 

FIGURE 5b is a schematic drawing of a user output interface wherein 
a computer provides output information corresponding to the input information of 
10 Figure 5a; 

FIGURE 6 is a block diagram flow chart illustrating the solution of 
conservation equations of the present invention; 

FIGURE 7 is a schematic diagram illustrating multiplex PGR design; 

FIGURE 8 shows prediction of molecular beacon net hybridization 
1 5 thermodynamics ; 

FIGURE 9 shows simulation of molecular beacon hybridization 
concentrations at temperatures from 0 to 100°C; 

FIGURE 10 is a diagram of match vs. mismatch hybridization; 

FIGURE 11 shows match vs. mismatch hybridization simulation at 
20 different temperatures; 

FIGURE 12 shows a general case of competitive hybridization 
equilibria that can be solved using the described numerical methods; and 
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FIGURE 13 is an example of simultaneous equations for the general 
five molecule case. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In general, the method and system of the present invention include 
5 rigorous thermodynamic parameterization for Watson-Crick base pairs, internal 
mismatches, terminal mismatches, terminal dangling ends, co-axial stacking 
interactions, sodium and magnesium salt dependence, denaturants (urea, formamide, 
DMSO). In addition, loop parameters for hairpins, internal loops, bulges, and 
multibranched loops are included. For DNA essentially all the parameters have 

10 been previously published or all included in the Appendix hereto. Specifically, the 
parameters which have been published include Watson-Crick parameters, sodium 
dependence, GT, GA, CT, AC, AA, CC, GG, and TT mismatches. The 
parameters included herein include dangling ends, terminal mismatches, DNA loop 
parameters, and co-axial stacking parameters. For RNA, the parameters have been 

15 published by Douglas H. Turner et al. For DNA/RNA hybrid duplexes, the 
parameters have been published by Naoki Sugimoto. 

The method and system are adapted for future implementation of 
parameters for modified nucleosides (including but not limited to inosine, 5- 
nitroindole, PNA, MOE-modified RNA, and iso-bases). With these parameters, it 

20 is possible to predict the melting temperature, Tm, of a duplex within 2°C on 
average. Correction for surface effects for DNA chip arrays is also implemented. 
In addition to predicting duplex hybridization, the software accounts for single- 
strand secondary structure. This is accomplished by a new numerical procedure for 
solving complex coupled equilibria (multi-state model). With this approach, it is 

25 possible to accurately predict not only the Tm for hybridization but also the 
concentration of every species in the solution (e.g. match duplex, mismatch, duplex, 
folded target, folded primer, primer dimer, etc.) at every temperature from 0 to 
100°C. Thus, it is possible to use this software to design oligonucleotide 
hybridization with optimized temperature, salt, and strand concentrations. 
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Predicting Accurately Primer Target Interaction Stability 

The stability of a primer/target or probe/target complex can be 
described by the free energy of association of the probe and the target. The most 
accurate way to calculate free energy of association is to use the nearest-neighbor 
5 model with accurate thermodynamic parameters. Thermodynamic parameters 
should account for Watson-Crick base pairs (SantaLucia et al., 1996; Allawi & 
SantaLucia, 1997; SantaLucia, 1998), single mismatches (Allawi & SantaLucia, 
1997), terminal mismatches (disclosed herein), dangling ends (Bommarito, Pugret 
& SantaLucia, 2000) and possibly double mismatches. Proper calculation of the 

10 monovalent and divalent salt dependence is also important (SantaLucia, 1998). 
Other loop motifs for hairpins, bulge, internal loops and multi-branched loops are 
important for single strand secondary structure prediction, but are often very 
crudely approximated. Moreover, when primer and target folding can occur, a set 
of coupled equilibria should be used to model the system. The nearest-neighbor 

15 model needs to be used to determine the equilibrium constant of each equilibrium. 
The determination of possible primer or target folding can be addressed by using 
secondary-structure prediction algorithms like M. Zuker's MFOLD (Zuker, 1989). 

Secondary Structure and Net Hybridization Thermodynamics 

Species Concentration Calculations 

20 Consider a system of strands SI and S2 with four states: folded 

target, folded probe, probe bound to target, and random coil target and probe. The 
model can be described by three equilibria as shown in Figure 1. 

The concentrations of every species for such a system can be 
analytically determined. The three equilibrium constants for such a system are 
25 shown below: 
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si=hi (2 ) 



82=112 *-m <3! 



where SI, S2, HI, H2, and DH are the random coil SI, the random coil S2, the 
folded strand HI, the folded strand H2, and the double helix DH, respectively. The 
5 conservation of SI and S2 leads to the following equations: 

C™ = Sl+ Hl+ DH (4) 

Cjf = S2+ H2 + DH (5) 

where Csf 1 are the total concentrations of SI and S2. [DH] and [S2] can be 
10 expressed as a function of [SI] by substituting the [HI] obtained from Equation 2, 
in Equation 6, and then substituting the [DH] obtained by Equation 6 in Equation 
1. 

[Di/]=Cjr'-[Sl]-# 2 [Sl] (6) 



15 Substitution of [H2], [DH], and [S2] from Equations 3, 6 and 7 in 

Equation 5 leads to an expression of Kj that can be rearranged as a quadratic 
equation in [SI]: 

[SI] 2 (K, + K2 K,) + [S1](K, + K 3 + K 2 K 3 - K, C SI + Kj + 1) - (K 3 + 1) C SI = 0 

(8) 

20 This equation is simplified by making the following substitutions: 
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a = (K,+K 2 K,) (9) 
b = (Kj C™ + K 3 + K 2 K 3 - K, C™ + K 2 + 1) (10) 

c = (k 3 + i) c™ (H) 

The physical solution of the quadratic equation (i.e. positive root) is (Press, 1999): 

mm -t*Jtt (12) 



or 



The second equation has better numerical stability (Press, 1999). 
[DH], [S2], [HI], and [H2] can then be calculated using Equations 1-3. 
10 Determination of Net Free Energy 

The net free energy of hybridization is calculated as follows: 



*G> 31ml =-RT\nK„, (14) 



where 



k [DH] 

single stranded ] jl^gto 



15 where [SI single stranded] and [S2 single stranded] are the concentrations of SI and 
S2 either in the random coil state or the hairpin states, at the temperature of the 
simulation. Using the conservation of SI and S2, Equation 14 is rewritten as 
follows: 

-16- 
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[DH] 

\ClT'-[DH])i.ClT'-[DH]) 



Note that AG° r m has the unusual property that it depends on the total strand 
concentrations, dsj* 1 and Csf 1 . The net free energy expresses the duplex formation 
equilibrium free energy corrected for secondary-structure formation in the single 
5 strands. 

Determination of Net Meltins Temperature 

If the strands are non self-complementary two cases have to be 
considered depending on the relative strand concentrations: 

1) If SI is the limiting reagent (C%* [ < C™)> at T M : 



10 [DH\^^ClT l (17) 

The concentrations of strands [SI] and [S2] are given by the following relations: 



Cl?* = \c T s r al + K 2 [Sl] + [SI] (18a) 



s-i Total 

CT =2(A?7i) <18b) 



C£" • jC" 1 + K,[S2)* [S2] (19a) 



ft Total _ ^s-iTotal 

15 rs2i= ~ 2 " d9b) 
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The replacement of [SI] and [S2] in Equation 1 gives: 



(20a) 

|OlJ|OZJ rTot al _ ± r Total 

^52 o S] 



n 1 /-iToial /-i Total . 

K 2 + K 2 K 3 -Kg^+j (20b) 
2 ~ ^ 



Using the relation AG° T = -R T In K, Equation 20 is arranged as follows: 



r (2) -AG' r (2)-AG» r (3) -AG' r (3) ^ AG* r Q) 



(21) 

AG° T can then be decomposed as AG° T =AH°- T AS° (assuming AC p =0 ) to obtain: 
0=^C™ + C™ + e RT e R 

-A//«(2)-A//°(3)+A//»(l) A5 0 (2)4-A5°(3)-AS o (l) -A/y(3)+AJy(l) AS°(3)-A5°(1) A//°(1) -AS fl (0 

+ 6 * r e 72 +e RT e R + e RT e R 

(22) 

10 The above equation can be solved by bisection or other numerical techniques to find 
T. This solution is the net melting temperature. 

2) If S2 is the limiting reagent (C^ 1 < Cg*), the following relation can 
be deduced by a similar approach: 



0=\c™-C™+ K3 + K3 ^ K2 + l (23) 

15 Again, application of the bisection method to an equation symmetric to Equation 22 
affords the net melting temperature. 
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If the strand S is self complementary, the reactions are described by 
the following equilibria: 



S + S-DH *i = ^T (24) 

IP J 



S-H K 2 = 1 ^ (25) 



AtT M :[DH]=^C?' a ' (26) 



The strand conservation equation is: 



Cs°"" = [S] + [#] + 2[DH] (27) 



Insertion of [H] and [DH] from Equations 25 and 26 in Equation 27 leads to: 



C™ = 2(A, + o [5] = 2 ^ + l) (28) 



10 Introduction of [S] in Equation 24 gives: 



v [£*H (^2 +1+ 2.g 2 ) , - r<Me , ,- Q . 

TsF = — — ~ 



Using the relation AG° T = -R T In K, Equation 29 is rearranged as follows: 



-2&OV2) -AG' r (2) AC'r(l) 

0=e « r +2e OT -Cj'"'"e « r +1 (30) 
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AG° T can then be decomposed as AG° T =AH°- T AS 0 to obtain: 



-2A//»(2) +2AS°(2) -A//°(2) + AS°(2) -Atf°U) + AS°(1) 

0=e RT e R +2e RT e * -C^e RT e R +1 (31) 

This equation can be solved by bisection to afford the net melting temperature. 

An experimentally validated example of the accuracy of the net 
5 hybridization thermodynamics is shown in Figure 8 for molecular beacons. At the 
top of Figure 8 are the predicted thermodynamics for simple duplex formation 
assuming no competing single strand secondary structure. Using Module 1 of the 
invention, these results are similar to what would be predicted using other 
commercial software (such as oligo 6.0), though our thermodynamic database 

10 includes the dangling end effects and salt corrections are more accurate than other 
software. The middle of Figure 8 shows the single strand folding at the molecular 
beacon as output from DNA-MFOLD. The bottom table of Figure 8 shows the 
experimentally determined A6 (effective) and Tm (effective) published in Bonnet et 
al. 1999, as well as the effective Tm and A6 (effective) predicted with Module 1 

15 using the coupled equilibria calculations. Note the close agreement between 
experiments and predictions in the bottom table and the disagreement between 
experiments and the predictions using the naive simple hybridization calculation (top 
table of Figure 8). Also note the good agreement in the bottom table for the fully 
matched A-T sequence and mismatch A- A, A-C, and A-6 sequences, thus validating 

20 the mismatch parameters. 

Further, the net hybridization calculations can be extended to 
different temperatures as shown in Figure 9, to reveal how the concentrations of all 
species change with temperature. Given the extinction coefficients and fluorescence 
quantum yields, the concentration vs. temperature profiles shown in Figure 9 can 
25 be used to calculate the fluorescence vs. temperature profile (not shown), thereby 
allowing the prediction of the temperature which produces the maximum 
fluorescence signal and minimum background fluorescence signal. 
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Another manifestation of the concentration calculations is for match 
vs. mismatch discrimination (Figure 10), whereby the concentrations of all species 
at all temperatures can be calculated (Figure 11). For the particular case shown, 
optimal match vs. mismatch discrimination is predicted to occur at 0°C. The 
5 concentration calculations can be generalized for cases in which molecules can form 
many different competing unimolecular, biomolecular, and higher order complexes 
(Figure 12) using generalized equations such as shown in Figure 13 for the five 
molecule case, and solved using the algorithm in Figure 7. 

Algorithm 

10 The hybridization prediction algorithm of the present invention is 

based on a nearest-neighbor-model analysis of the sequences. The algorithm 
accounts for structural motifs including Watson-Crick base pairs (Allawi and 
SantaLucia, 1997; SantaLucia, 1998; Sugimoto et al., 1995; Xia et aL, 1998), 
single internal mismatches (Allawi and SantaLucia, 1997; Allawi and SantaLucia, 

15 1998; Allawi and SantaLucia, 1998; Allawi and SantaLucia, 1998; Kierzek et al., 
1999; Peyret et aL, 1999; SantaLucia, 1998), double mismatches (Allawi and 
SantaLucia, 1997) coaxial-stacking interfaces (disclosed herein) (Walter and Turner, 
1994), terminal mismatches (disclosed herein) (Freier et al., 1986) and dangling 
ends (Bommarito et al., 2000; Freier et al., 1986). Once the motifs are identified 

20 and their thermodynamic contributions are added, the sum may be corrected for salt 
effects (sodium and magnesium) and the net hybridization is calculated when 
appropriate. 

Algorithm Functions 

A first or main module of the algorithm calculates the hybridization 
25 thermodynamics (AH°, AS°, AG° 37 , T M ) of a given duplex. Net hybridization 
accounting for secondary structure in both strands is also calculated. 
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Parameterization 

Parameters are organized in three arrays. The first array contains 
internal element parameters: Watson-Crick nearest neighbors and single mismatch 
nearest neighbors. The second array contains terminal element parameters: terminal 
5 mismatches and dangling ends. A single parameter is used to account for double 
mismatches except for tandem G*T mismatches, which are explicitly enumerated 
(Allawi & SantaLucia, 1997). The third array contains coaxial-stacking parameters 
(contained herein). 

For DNA sequences, the thermodynamic contribution of all Watson- 
10 Crick nearest neighbors and single internal mismatches has been systematically 
studied (Allawi and SantaLucia, 1997; Allawi and SantaLucia, 1998; Allawi and 
SantaLucia, 1998; Allawi and SantaLucia, 1998; Peyret et al., 1999). A limited 
number of sequences containing double mismatches has also been studied (Allawi 
and SantaLucia, 1997). The contributions of dangling ends (Bommarito et al., 
15 2000) have also been systematically analyzed. Salt corrections are available for 
sodium in the range 0.01 to 1 M (SantaLucia, 1998). 

For RNA sequences, the thermodynamic contribution of all Watson- 
Crick nearest neighbors has been systematically studied (Xia et al., 1998). A limited 
number of sequences containing single mismatches has also been studied (Kierzek 
20 et al., 1999). The contribution of dangling ends and terminal mismatches has also 
been systematically analyzed (Freier et al., 1986). No salt correction has been 
developed for RNA and therefore the DNA salt corrections are assumed. These 
corrections are likely to be deficient in the case of RNA. 

For DNA/RNA hybrids, the thermodynamic contribution of all 
25 Watson-Crick nearest neighbors has been systematically studied as well as a limited 
number of sequences containing single mismatches (Sugimoto et al., 1995). As no 
salt correction has been developed for DNA/RNA hybrids, the DNA corrections are 
assumed. The applicability of these corrections to DNA/RNA hybrids has not been 
tested. The parameter arrays are designed to easily accommodate implementation 
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of new parameters and salt corrections including thermodynamics parameters for 
modified bases and denaturant effects. 

Correction for Hybridization to DNA Microchips 

A linear correction of the free energy is implemented in the algorithm 
5 of the invention to correct for hybridization to DNA microchips: 

AG° 37 (microchip) = aAG° 37 (sohition) +b (32) 

where a and b are user defined real coefficients. Fotin et al. (Fotin et al., 1998) 
showed that a linear relationship could be used to relate the free energies obtained 
for hybridization in solution and on microchip surfaces. However, the relation 
10 between thermodynamics measured in solution and thermodynamics measured using 
microarrays is still unclear and appears to be different depending on the 
manufacture and type of microarrays. 

User Interface: Input and Output 

Figure 2a shows the user interface input. The users enter the 
15 sequence of each strand, the hybridization conditions (hybridization temperature, 
strand concentrations, and monovalent cations and concentrations), and 
thermodynamic corrections for single strand folding. Figure 2b shows the output 
corresponding to the input in Figure 2a. 

The algorithm can be used via the Internet at: 
20 http://jsll.chem.wayne.edu/Hvther/hvthermlmain.html . The algorithm may be 
written in FORTRAN 77 and run on UNIX environment or other languages and 
environments. 
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Molecular Beacons 

The algorithm may be used to predict the thermodynamics of a set 
of literature measurements for molecular beacons (Bonnet et al., 1999). Molecular 
beacons are high specificity probes that are efficient for mutation analysis 
5 (Giensendorf et al., 1998) and multiplex detection of single nucleotide variations 
(Marras et al., 1999). The design and efficiency optimization of these beacons is 
helped by hybridization thermodynamics prediction. Bonnet et al. studied, the 
hybridization of the molecular beacon 5 CGC, TCC, CAA, AAA, AAA, AAA, 
CCG AGC G 3 ' to a set of four different targets including a perfect match duplex, 
10 and three different duplexes containing one mismatch. Free energy and enthalpy for 
duplex folding may be calculated using the DNA MFOLD program 
(http://mfold2.wustl.edu/-mfold/dna/forml.cgi). These parameters may then 
incorporated as secondary structure corrections in Figure 2a. 

The software to implement the algorithm may be written in 
15 FORTRAN, C ++ , Visual Basic, HTML, and JAVA script computer languages. 
Two graphical user interfaces may be provided: Windows application and web 
browser format. The software may run on IBM/PC, Sun, and Silicon Graphics 
platforms. 

The software may be written in several modules as described below. 

20 A. Interactive Mode: Command Line Interface in MS-DOS 

MODULE 1 (As Previously Described above) 

Function . Module 1 predicts the hybridization thermodynamics of a given 
duplex (DNA/DNA, RNA/RNA, or DNA/RNA). 
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Input (Figure 2a) 

Input of Sequences 

1. Only the following characters are accepted: A, a, C, c, G, g, 
T, t, U, u, /, *, +. Single blank characters and numbers will be automatically 

5 edited, but more than one carriage return is not permitted. 

2. If the duplex contains a dangling end on a strand, the 
sequence of the other strand should contain a * at the corresponding position. (This 
is very important to include for primer binding to a large target sequence). Note; 
The top strand must be entered in 5* to 3' orientation, but the bottom strand must 

10 be entered in 3' to 5 1 orientation. Also, a 44 + " must be added at the end of each 
sequence. There is a length limit of 1024 characters for sequence entries. In 
module 1, it is important to be sure that both sequences have the same length. 

Example: AAA ACCCCTGA + 
*TTTGGGGAC*+ 

15 3. Only the bottom strand may contain coaxially stacked 

nucleotides. A "/" should be inserted at the site of a strand nick (i.e. between the 
coaxially stacked nucleotides). This feature is useful for predicting stacked 
hybridization stability. 

Example: AAAACCCCC + 
20 TTTT/GGGG+ 

Input of Salt and Strand Concentrations 

The monovalent salt should be the sum of all monovalent cation 
concentrations in a solution in units of molarity. For example, a solution of 100 
mM KC1, 50 mM NaCl, 10 mM Na 2 P0 4 , 0. 1 mM Na 2 EDTA would account for a 
25 total of 0. 1702 M monovalent. The thermodynamic predictions are applicable over 
a salt range of 0.01 to 1 M monovalent cation. The correction applied is from 
SantaLucia (1998) Proc. Natl Acad. Sci. 95, 1460. The sodium correction applies 
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for oligonucleotides with fewer than about 30 base pairs. For longer duplexes a 
polymer correction is required, but this is not currently implemented. 

Strand concentrations are entered in units of molarity. The program 
will accept virtually any physically relevant strand concentration. 

5 Hybridization temperature is in Celsius degrees. The limits are 0 to 

100 degrees. 

Special corrections for single-stranded secondary structure and for 
surface corrections for hybridization arrays can be input. The units for input AG° 
are kcal/mol. To determine estimates of single-strand folding energies, see Michael 
10 Zuker's RN A or DN A-MFOLD servers (see http: 
//mfold2. wsutl.edu/-mfold/dna/forml.cgi). The current thermodynamic prediction 
software incorporates the special corrections for single-stranded secondary structure 
and for surface corrections for hybridization arrays. 

For DNA chip arrays, a linear correction can be applied. The user 
15 inputs the slope and intercept coefficients. Based on the work of Mirzabekov 
group, a slope of +1.1 and intercept of +3.2 are appropriate (see Fotin et al. 
(1998) Nucleic Acids Res. 26, 1515-1521). 

Output (Figure 2b) 

Module 1 outputs the hybridization thermodynamics at 1.0 M NaCl 
20 and 37°C (the conditions under which the thermodynamic predictions are most 
accurate), under the salt temperature conditions specified by the user, and also 
displays the net hybridization Tm and AG 0 if the user specifies that special 
corrections are needed (this allows for single-strand secondary structure of both the 
target and probe DNA to be accounted and for surface effects of chip arrays). 
25 Predictions of AG°, AH°, AS°, and Tm are provided. 
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MODULE 2 

Function . Module 2 finds the best primers of given length 
complementary to a long target nucleic acid. DNA/DNA, RNA/RNA, DNA/RNA 
hybridization types are accepted. The user selects the number of primers to output, 
5 and the program finds the most stable primers and gives their hybridization position 
and thermodynamics of each primer. 

Input (Fimre 3a) 

Input of Salt and Strand Concentrations 

The input of strand and salt concentrations is similar to Module 1 . 

10 Input of Sequences 

The target sequence is input as in Module 1. 

Output (Figure 3b) 

Primer Length and Number of Best Primers 
Module 2 displays "number of best primers" best primers of length 
15 "primer length" in order of decreasing stability. 

Output 

Module 2 outputs "number of best primers" best primers of length 
"primer length" in order of decreasing stability along with their hybridization 
thermodynamics. 

20 MODULE 3 

Function . Module 3 walks a given primer along a given target and 
finds the thermodynamics for the best target/primer complex and for the competitive 
target/primer complexes: DNA/DNA, RNA/RNA, DNA/RNA, hybridization types 
are accepted. 
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Input (Figure 4a) 

Input of Sequences 

The input is similar to Module 1. The target has to be longer than 

the primer. 

5 Input of Salt and Strand Concentrations 

The input of salt and strand concentrations is similar to Module 1. 

Percent Stability p of Alternative Binding Sites 
Compared to the Most Stable Binding Site 

This parameter excludes all competitive sites that are not within the 

10 defined percent of the best primer stability. If the best primer stability is -5 

kcal/mol and p = 10 then any competitive site of energy higher than -5 + (10/100*5) 

= -4.5 kcal/mol will not be displayed. 

Number of Base Pairs Required to Compute the Solution 
This parameter excludes all competitive sites that contain less 
15 Watson-Crick base pairs than the defined value. 

Output (Figure 4b) 

Module 3 outputs the best primer binding site and the competitive 
binding sites that pass the filtering criteria (percent stability p of alternative binding 
sites compared to the most stable binding site and number of best primers). 

20 MODULE 4 

Function , Batch mode calculations (see below). 
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MODULE 5 

Function . Module 5 is a combination of Modules 2 and 3 and finds 
the n best primers of given length complementary to a given section of a target and 
display the thermodynamics of the target/primer system(s). Then, each best primer 
5 is walked along the whole target to find the competitive hybridization sites. The 
thermodynamics of the target/primer systems at these alternative sites is then 
displayed. DNA/DNA, RNA/RNA, DNA/RNA, hybridization types are accepted. 

Input (Figure 5a) 

Input of Sequences 
10 The target sequence is input as in Module 1 . 

Input of Salt and Strand Concentrations 

The input of salt and strand concentrations is similar to Module 1. 

Sequence Section Where to Find the Best Primers 
Module 5 finds the best primers in the target region ranking from 
15 "position of initial nucleotide" to "position of final nucleotide" . Note that Module 
5 then looks for competitive sites of each best primers in the whole target. 

Percent Stability of Alternative Binding Sites 
Compared to the Most Stable Binding Site 

The function of this parameter is the same as in Module 3. This 

20 parameter is input for each best primer corresponding to the "number of best 

primer" specified. 

Number of Base Pairs Required to Compute the Solution 
The function of this parameter is the same as in Module 3. This 
parameter is input for each best primer corresponding to the "number of best 
25 primer" specified. 
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Output (Fieure 5b) 

Primer Length and Number of Best Primers 
Module 5 displays "number of best primers" best primers of length 
"primer length" by order of decreasing stability. 

5 Output 

Module 5 displays "number of best primers" best primers and their 
competitive sites by order of stability along with their hybridization 
thermodynamics . The best primer and its ranked competitive hybridization sites are 
listed first. Then, the second best primer is listed with its competitive hybridization 
10 sites. 

MODULE 6 

Function , Module 6 is similar to Module 3 and walks a given primer 
along a given target and finds the thermodynamics for the best target/primer 
complex and for the competitive target/primer complexes: DNA/DNA, RNA/RNA, 
15 DNA/RNA, hybridization types are accepted. Then, Module 6 simulates the 
concentration of every species at every degree from 1 to 100 °C, as illustrated in 
Figure 6. 

Input (Not Shown) 

Input of Sequences 

20 The input is similar to Module 1. The target has to be longer than 

the primer. 

Input of Salt and Strand Concentrations 

The input of salt and strand concentrations is similar to Module 1. 
Percent Stability p of Alternative Binding Sites 
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Compared to the Most Stable Binding Site 

This parameter excludes all competitive sites that are not within the 
defined percent of the best primer stability. If the best primer stability is -5 
kcal/mol and p=10, then any competitive site of energy higher than: -5 + 
5 (10/100*5) = -4.5 kcal/mol will not be displayed. 

Number of Base Pairs Required to Compute the Solution 
This parameter excludes all competitive sites that contain less 
Watson-Crick base pairs than the defined value. 

Correction for Target/Target Interaction, Target folding, 
10 Primer/Primer Interaction and Primer Folding 

The user is asked if he wants to correct for the interactions above. 

If the answer is "y", the user is prompted for AH° 37 corresponding to the 

interaction. Secondary structure thermodynamics can be determined using the 

Zuker algorithm as discussed in Module 1 section. 

15 Output (Not Shown) 

Concentration Output Filename 

The results from the concentration simulations (concentration of 
species at every temperature) are saved in this file. 

Output 

20 Module 6 outputs the best primer binding site and the competitive 

binding sites that pass the filtering criteria (percent stability p of alternative binding 
sites compared to the most stable binding site and number of best primers). The 
concentration simulations are saved in a file specified by the user. 



-31- 



WO 01/94611 



PCT/US01/18424 



MODULE 7 

Function . Module 7 is a combination of Modules 2 and 5 and finds 
the n best primers of given length complementary to a given section of a target and 
display the thermodynamics of the target/primer system(s). Then, each best primer 
5 is walked along the whole target to find the competitive hybridization sites. The 
thermodynamics of the target/primer systems at these alternative sites is then 
displayed. DNA/DNA, RNA/RNA, DNA/RNA hybridization types are accepted. 
Then, Module 7, like Module 6, simulates the concentration of every species at 
every degree from 1 to 100°C, as illustrated in Figure 6. 

10 Input (Not Shown) 

Input of Sequences 

The target sequence is input as in Module 1 . 
Input of Salt and Strand Concentrations 

The input of salt and strand concentrations is similar to Module 1. 

15 Sequence Section Where to Find the Best Primers 

Module 7 finds best primers in the target region ranking from 
"position of initial nucleotide" to "position of final nucleotide. " Note that Module 
7 then looks for competitive sites of each best primers in the whole target. 

Percent Stability of Alternative Binding Sites 
20 Compared to the Most Stable Binding Site 

The function of this parameter is the same as in Module 3. This 

parameter is input for each best primer corresponding to the "number of best 

primer" specified. 
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Number of Base Pairs Required to Compute the Solution 

The function of this parameter is the same as in Module 3. This 

parameter is input for each best primer corresponding to the "number of best 

primer" specified. 

5 Correction for Target/Target Interaction, Target folding, 
Primer/Primer Interaction and Primer Folding 

For each best primer, the user is asked if he wants to correct for the 

interactions above. If the answer is a y M , the user is prompted for AH° and AG° 37 

corresponding to the interaction. Secondary structure thermodynamics can be 

10 determined using the Zuker algorithm as discussed in Module 1 section. 

Concentration Output Filenames 

For each best primer, the results from the concentration simulations 
(concentration of species at every temperature) are saved in this file. The user has 
to select a different filename for each best primer. 

15 Output (Not Shown) 

Output 

Primer Length and Number of Best Primers 

Module 7 displays "number of best primers" best primers of length 
"primer length" by order of decreasing stability. 

20 Module 7 displays "number of best primers" best primers and their 

competitive sites by order of stability along with their hybridization 
thermodynamics. The best primer and its ranked competitive hybridization sites are 
listed first. Then, the second best primer is listed with its competitive hybridization 
sites. For each best primer, a file named by the user contains the concentration 

25 simulations. 
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Module 7 allows the user to design optimal primers for applications 
where multiple simultaneous hybridization reactions are occurring, including match 
vs. mismatch hybridization, molecular beacons, DNA oligonucleotide arrays, and 
multiplex PGR. 

5 One commercially important example for the use of Module 7 for 

primer design in a complex hybridization solution is Multiplex PCR, as shown in 
Figure 7. Module 7 allows the user to design optimal primers for Multiplex PCR 
where multiple primers have equal stabilities in binding to the target DNA. Several 
primers must be designed to specifically bind to different sites on target DNA at a 
10 given temperature with minimal background binding to mismatch sites and with 
minimal cross-hybridization between pairs of primers. 

Module 7 minimizes potential primer dimer formation and mismatch 
hybridization for all combinations of input primers. Module 7 optimizes primer 
sequence position, length, and concentration for each primer in relation to all other 
15 species in solution and provides a hybridization profile at all temperatures from 0 
to 100°C. 

Batch Mode 

MODULE 4 

Function , Module 4 allows any of the previous modules to be run 
20 in batch mode using text files to submit the input and having the data output as text 
files also. 

Type of Input Files 

There are two types of input files: 1) parameter input file, and 2) 
sequence input file. Parameter input files describe what modules to run with what 
25 hybridization parameters and on how many sequences to run them. Example of 
parameter input files for each module with comments are given in the "Batch mode 
parameter files folder." Sequence files contain the sequences that are going to be 
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hybridized in the conditions described by the parameter input files. Examples of 
parameter input files for each module with comments are given in the "Batch mode 
sequence files folder." 

Note that a parameter file can successively run different modules on 
5 various different sequences. 

The user is successively asked for the names of the parameter input 
file, the sequence input file and the thermodynamic data output file. Note that these 
files have to be in the directory containing the executable version of the software. 
Output files will also be created in this same directory. Names of the concentration 
10 simulation files are specified in the parameter input files. 

Examples of Batch Mode Parameter Files 

Comments in parentheses describe the meaning of each entry (note 
that an actual parameter file must not contain these comments). 

DUP (Module 1: Simple duplex calculations) 

15 1 (Number of sequences to apply this parameter file to) 

1 (Monovalent cations concentration mol/L)) 

1 Mg 2+ concentration mol/L) 

37.0 (Hybridization temperature) 

4e-4 (Top strand concentration mol/L) 

20 4e-4 (Bottom strand concentration mol/L) 

1 (Correction for microchips: slope) 

0 (Correction for microchips: intercept) 

0 (Correction for top strand folding: AG° 37 ) 

0 (Correction for top strand folding: AH° 37 ) 

25 0 (Correction for bottom strand folding: AG° 37 ) 

0 (Correction for bottom strand folding: AH° 37 ) 

END (End of file required) 
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(Number of sequences to apply this parameter file to) 


15 


1 


(Monovalent cations concentration moi/i^; 




1 


(Mg concentration moi/jL; 




^7 

j 1 


^riyDriuiZaiion iciupcrdiurc; 




4e-4 


(Top strand concentration mol/L) 




4e-4 


(Bottom strand concentration mol/L) 


zu 


90 


(Percent window of best primer stability for alternative sites) 




2 


(Number of WC base pairs required to compute the solution) 




cgcg+ (Primer sequence, + required) 




1 


(Correction for microchips: slope) 




0 


(Correction for microchips: intercept) 


25 


END 


(End of file required) 



BPW (Module 5 displays "number of best primers" best primers and their 
competitive sites by order of stability along with their hybridization 
thermodynamics) 

1 (Number of sequences to apply this parameter file to) 
30 1 (Lower limit of primer search area) 

10 (Upper limit of primer search area) 



-36- 



WO 01/94611 PCTAJS01/18424 

1 Mg 2+ concentration mol/L) 

37 (Hybridization temperature) 

4e-4 (Top strand concentration mol/L) 

4e-4 (Bottom strand concentration mol/L) 

5 4 (Primer length) 

1 (Number of best primers) 

1 (Correction for microchips: slope) 

0 (Correction for microchips: intercept) 

800 (Percent window of best primer stability for alternative sites) 

10 2 (Number of WC base pairs required to compute the solution) 

END (End of file required) 

PWC (Module 6 primer walk with concentration calculations) 

1 (Number of sequences to apply this parameter file to) 
1 (Monovalent cations concentration mol/L) 

15 1 Mg 2+ concentration mol/L) 

37 (Hybridization temperature) 

4e-4 (Top strand concentration mol/L) 

4e-4 (Bottom strand concentration mol/L) 

90 (Percent window of best primer stability for alternative sites) 

20 2 (Number of WC base pairs required to compute the solution) 

cgcg+ (Primer sequence, + required) 

1 (Correction for microchips: slope) 

0 (Correction for microchips: intercept) 

0 (Correction for target folding: AG° 37 ) 

25 0 (Correction for target folding: AH°) 

0 (Correction for target/target interaction: AG° 37 ) 

0 (Correction for target/target interaction: AH 0 ) 

0 (Correction for primer folding: AG° 37 ) 

0 (Correction for primer folding: AH°) 

30 0 (Correction for primer/primer interaction: AG° 37 ) 

0 (Correction for primer/primer interaction: AH°) 

outconc (Concentration output file name) 
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END (End of file required) 
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fRnttnm ctrnnH pnnppritrjitinTi mnl/T ^ 

^U\J HUlll oLiuJ-LU WU-UVwllLl d LIU 11 1UU1/ X-ij 






/T^rimpr IpncrtVA 

\1 llillvl ICllgLUy 




1 

l 


fNiimhpr nf hpQt nrimpr^ 

l^J.1 UlllUt-1 \JL UwOL pililltsl OJ 




1 
1 


^^nrrppfinn fnr mir*rnf*hinc* dnnp^ 

^^UllCdlUIl 1U1 liilUi UULlipo . oiupcy 




o 


^r^nrrpptiAn fnr miprnphinQ* intprppnA 

^V^UllGdlUll 1U1 Ulld Ul>llipd* llllVlCwpiy 




800 


^Pprrj^nt winHnw nf hp^t nrimpr <2tahi1itv fnr altprnativp QitPQ^ 
y± vi ^o^iiL wiuuuw ui uvo l uiii.tj.wi oiauiiikjr lyjk cutwiiiati v w oiluo y 




2 


fNumher nf AVP ha<ie nairs reauired tn comniite the solution^ 

yli tuiiut'l ui it v ucio^ P vUUii L\j wiiiuuiu uiw juiuuuuy 




0 


^Correctinn fnr target folding* AG°^ 




o 


^r^nrrectinn fnr target folding" ATT°^ 

\w v/1 1 WL1VI11 lUl IcUgWL lulUiUg> LXM.X J 


20 


o 


ffVirrpptinn fnr taropt/tarcpt intpraptinrv Afr 0 „.,^ 
\V^ui i v^ui/u iisi ten Leu gii uiiwi awtiv/ii» uu 27/ 




0 


(Correction for target/target interaction: AH 0 ) 




0 


(Correction for primer folding: AG° 37 ) 




n 
u 


^v^uircLiiun lor pruncr luiujuig. tin ^ 




o 


^r^nrrppHnn fnr nrimpr/nrimpr ititpraptinn • ACt 0 -.-^ 
^v--ui i cuLiuii iui pi uiit-i/ pi unci liiLviadiuu* Lx\j 27/ 




o 


^f^nrrpp rinn fnr nrimpr/nrimpr intprnpHnn ■ ATT°^ 
^ wui i cuiiuii xui pi uiici/pi miwi midciuiiuii. un ^ 




outconc 


(Concentration output file name) 




END 


(End of file required) 




PPW 


(Module 8: walk a primer along itself to find interaction sites. PWA 






with probe = primer) 


30 


1 


(Number of sequences to apply this parameter file to) 




1 


(Monovalent cations concentration mol/L) 
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1 

1 


ivig CUJiuCUU allUll jjA\JU Lsj 


37 


(Hybridization temperature) 


4e-4 


(Primer concentration mol/L) 


900 


(Percent window of best primer stability for alternative sites) 


2 


(Number of WC base pairs required to compute the solution) 


END 


(End of file required) 



Examples of Batch Mode Sequence Files 

For Module 1: Pup 
1 (Sequence number) 

10 agcgca-t- (Top strand sequence) 
tcgcgt+ (Bottom strand sequence) 

For Module 2: NBP 

1 (Sequence number) 

agcgca+ (Target sequence) 

15 For Module 3: 

1 (Sequence number) 

cgcctgcggccc+ (Target sequence) 

For Module 5: bpw 
1 (Sequence number) 

20 cgcctgcgccc+ (Target sequence) 

For Module 6: pwc 

1 (Sequence number) 

agcgca + (Target sequence) 

For Module 7: bwc 
25 1 (Sequence number) 

agcgca + (Target sequence) 
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For Module 8: ppw 
. 1 (Sequence number) 

agcgca+ (Primer sequence) 

Example of Batch Mode Parameter and Sequence Files 
5 to Run Different Modules Successively 



Parameter File : 

DUP (Executes Module 1) 

2 (Apply to Module 1 to 2 sequence sets) 

0.05 
10 1.5e-3 
37.0 
le-6 
2e-7 
1 

15 0. 

-2.12 
-37.3 
0 
0 

20 PWC (Executes Module 6) 

1 (Apply to Module 6 to 1 sequence set) 

0.16 

0.0025 

37 

25 10e-9 
le-9 
800 
8 

TCGAACGTAC + 
30 1 
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0 
0 
0 
0 

5 0 
0 
0 
0 
0 

10 outwash 

DUP (Executes Module 1) 

4 (Apply to Module 1 to 4 sequence sets) 

1 

0 

15 37.0 
le-6 
le-6 
1 
0 

20 0 
0 
0 
0 

END 

25 Other modules can be similarly appended. 
Sequence File 

1 (input for Module 1) 

ttgcctaggggaccagg tccaact + 
aacggatcccctggtccaggttga + 
30 2 (input for Module 1) 
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ttgcctaggggaccaggtccaact + 
aacggatcccctggtccaggttga + 
3 

C AGCTTGCATGA A A AGCTTGCGTGT + 
5 4 

AAAAAA+ 
TTTTTT+ 
5 

acgcgc-H 
10 tgcgcg+ 
6 

gggaaagggg+ 
*cctttccc*+ 
7 

15 tttaaattt+ 
aaatttaaa+ 
8 

cgcgtgagggcc+ 
gcgctctccccgg+ 

20 Parameterization of the Algorithm of the Invention 

Caution: RNA/RNA and DNA/RNA duplexes contain motifs for 
which no literature data are available. In these cases, DNA/DNA parameters are 
assumed. Therefore, predictions might be inaccurate. Users are encouraged to use 
this program with caution and discernment. 

25 No data are available for the following motifs: 

RNA/RNA single mismatches 
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(input for Module 1) 

(input for Module 1) 

(input for Module 1) 

(input for Module 1) 

(input for Module 1) 
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RNA/DNA single mismatches 
dangling ends 
terminal mismatches 
Single mismatches 

5 Double mismatch parameters are estimated for all types of duplexes 

(DNA/DNA, RNA/RNA, DNA/RNA). 





DNA THERMODYNAMIC PARAMETERS 




Watson-Crick nearest-neighbors 






10 


SantaLucia, Allawi, and Seneviratne 
(1996) Biochemistry 35, 3555; 
Allawi and SantaLucia (1997) 
Biochemistry 36, 10581 


12 parameters 


108 sequences 




Sinele mismatch nearest-neighbors 






15 


Allawi and SantaLucia (1997) 
Biochemistry 36, 100581; 
Allawi and SantaLucia (1997) 
Nucleic Acids Res., 26, 2694; Peyret 
et al., (1999) Biochemistry 38, 3468 


44 parameters 180 sequences 

Allawi and SantaLucia (1998) Biocliemistry 37, 2170 
Allawi and SantaLucia (1998) Biodiemistiy 37, 9435 


20 


Terminal Mismatch nearest- 
neiehbors 

(Appendix) 


48 parameters 


48 sequences 




Dangling end nearest-neighbors 






25 


S. Bommarito, Peyret, SantaLucia 
(2000) Nucleic Acids Res. 28, 1929- 
1934. 


16 parameters 


16 sequences 




Na + dependence law 








SantaLucia (1998) Proc. Natl Acad. 
Sci. USA 95, 1460-1465 


1 parameter 


86 sequences 
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UNA THERMODYNAMIC PARAMETERS 


Watson-Crick nearest-neiehbors 


12 parameters 


Xia et al. (1998) Biochemistry 37, 14719 


Single Mismatches 

Kierzek, Burkard, and Turner (1999) 
Biochemistry 38, 14214-14223 


44 parameters 


Terminal mismatch nearest-neishbors 


48 parameters 


Freier et al. (1986) Proc. Natl Acad. ScL USA 83, 
9373 


Daneline end nearest-neiehbors 


32 parameters 


Freier et al. (1986) Proc. Natl Acad. Sci. USA 83, 
9373 


Coaxial stacking nearest-neighbors 


16 parameters \ 


Walter and Turner (1994) Biochemistry 33, 12715 


Loop parameters 

Matthews et al. (1999) J. Mol Biol 288, 911-940 





HYBRID DNA/RNA THERMODYNAMIC PARAMETERS 


Watson-Crick nearest-neighbors 


17 parameters 


Sugimoto et al. (1995) Biochemistry 34, 11211 


rU»dG and rG*dT mismatches 




Sugimoto et al. (1997) Nucleic Acids Symp. Ser. 37, 199 
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DNA LOOP THERMODYNAMIC PARAMETERS 



Hairpins 

Hilbers et al. (1985) Biochie 67, 685-695 

Blommers et al. (1989) Biochemistry 28, 7491-7498 

Antao et al. (1991) Nucleic Acids Res. 19, 5901-5905 

Antao et al. (1991) Nucleic Acids Res. 20, 819-824 

Senior et al. (1988) Proc. Natl. Acad. Sci. USA 85, 6242-6246 

Bulges 

LeBlanc and Morden (1991) Biochemistry 30, 4042-4047 
Zieba et al. (1991) Biochemistry 30, 8018-8026 
Ke et al. (1995) Biochemistry 34, 4593-4600 
Turner, D.H. (1992) Curr. Opin. Struc. Biol. 2, 334-337 

Multibranched Loops 

Kadrmas et al. (1995) Nucleic Acids Res. 23, 2122 
Lilley and Hallam (1984) /. Mol. Biol. 180, 179-200 
Lu et al. (1991) J. Mol. Biol. 223, 781-789 
Ladbury et al. (1994) Biochemistry 33, 6828-6833 
Leontisetal. (1991) Nucleic Acids Res. 19, 759-766 

The parameters for multibranched loops are from a best fit analysis of 
secondary structure predictions vs. experiments as done by Jaeger et al. for 
RNA (Jaeger et al. (1989) PNAS 86, 7706-7710). The current parameters for 
multibranched loops neglect the sequence and complicated length dependence 
described by Leontis and coworkers, but approximate 4-way junctions fairly 
well. Implementation of more complicated rules will require modification of 
the MFOLD algorithm. 



While the best mode for carrying out the invention has been 
described in detail, those familiar with the art to which this invention relates will 
recognize various alternative designs and embodiments for practicing the invention 
as defined by the following claims. 
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Table 1: Thermodynamic Parameters for Duplex Formation in 1M NaCl a . 



ATGAGCTCAA 
AArTPGAGTA 


AH ob 
(kcal / mol) 

-56.7 ± 2.5 


AS ob 
(cal/molK) 

-155.8 ± 3.7 


(kcal / mol) 
-9.07 ±0.12 


(°C) 
52.3 


A A G A G C TCTA 
ATCTCGAGAA 


-55.1 ± 1.4 


-149.2 ± 3.8 


-8.91 ± 0.12 


56.0 


AGTAGCTACA 
AC ATCA ATGA 


-60.3 ± 1.8 


-165.8 ± 5.1 


-9.03 ±0.12 


54.5 


ACGATATCGA 

A R f T A T A G T A 

fjk VJ V*> X ii. X V^ £^ 


-67.8 ± 1.3 


-192.0 ± 3.5 


-8.87 ± 0.06 


49.4 


CT GAGC TCA£ 
CAfTPGAGTC 

Vv 1 w VJ *Y VJ X Jo 


-50.6 ± 1.2 


-136.9 ± 2.8 


-8.15 ± 0.11 


52.9 


£A GAGC T CT £ 
ctctpgaga r 

V^ A V/ A V^ VJ iT. VJ rX V^ 


-51.4 ± 1.3 


-137.5 ± 3.2 


-8.34 ± 0.14 


56.9 


£ G T A G C T A C £ 
CCATCGATGC 


-55.8 ± 1.8 


-154.6 ± 4.8 


-8.04 ±0.16 


49.7 


£CGATATCG£ 
CGCTAT AGCr 

v^, v* w a ii. x n vj Vv v^ 


-59.8 ± 1.2 


-166.9 ± 3.0 


-8.12 ± 0.07 


49.7 


£T GAGC TCAS 
GACTCGAGTfi 

>-» **• V* X v* VJ A VJ X yj 


-52.6 ± 1.3 


-140.7 ± 3.3 


-8.57 ± 0.13 


57.7 


fiAGAGCTCTfi 
STCTCGAGAfi 


-52.3 ± 1.4 


-142.5 ± 4.1 


-8.34 ± 0.08 


52.3 


fiGTAGC TACfi 
SCATCGATGfi 


-59.2 ± 1.0 


-164.2 ± 2.8 


-8.68 ± 0.07 


51.2 


SCGATATCGfi 
GGCTAT AGC£ 


-65.7 ± 0.9 


-183.6 ± 2.2 


-8.81 ± 0.07 


52.2 
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Table 1 : Continued 8 . 



ITGAGCTCAI 
IACTCGAGTI 


AH° b 
(kcal / mol) 

-55.4 ± 1.1 


AS ob 
(cal/mol K) 

-149.4 ± 3.0 


AG° 3r b 
(kcal / mol) 

-8.62 i 0.10 


(°C) 
56.9 


1AGAGCTCT1 
JTCTCG AGAJ 


-56.5 ± 1.3 


-154.3 ± 3.7 


-8.72 ± 0.09 


54.3 


IGTAGCTACI 
ICATCGATGJ 


-63.8 ± 0.8 


-178.0 ± 2.1 


-8.75 ± 0.06 


52.1 


ICGATATCG1 
IGCTAT AGCI 


-66.8 ± 0.6 


-187.9 ± 1.6 


-8.60 ± 0.04 


50.5 


CTGAGCTCAA 
AACTCG AGT£ 


-53.6 ± 1.3 


-145.8 ± 4.0 


-8.42 ± 0.06 


53.6 . 


AT GAGC T C A£ 
£ACTCGAGT A 


-54.0 ± 1.3 


-144.4 ± 3.2 


-8.92 ± 0.15 


58.8 


£GT AGC T AC A 
AC ATCGATG£ 


-56.8 ± 1.4 


-155.6 ± 3.5 


-8.53 ± 0.13 


53.7 


AGTAGCTAC£ 
£CATCG ATGA 


-57.1 ± 1.2 


-156.0 ± 3.0 


-8.71 ± 0.16 


54.4 


£CGAT ATCGA 
AGCTAT AGC £ 


-61.8 ± 0.5 


-172.7 ± 1.4 


-8.30 ± 0.03 


50.6 


ACGATATCG£ 
£GCTAT AGCA 


-58.3 ± 1.7 


• -158.5 ± 4.2 


-8.91 ± 0.17 


56.8 


£AGAGCTCTA 
ATCTCGAGA£ 


-54.6 ± 0.6 


-147.9 ± 1.4 


-8.66 ± 0.07 


55.4 
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Table 1: Continued 8 . 





AH ob 
(kcal / mol) 


AS ob 
(cal/molK) 


AG° 3 - b 
(kcal / mol) 


(°C) 


AAGAGCTCT£ 


-55.5 ± 1.4 


-150.4 ± 4.4 


-8.85 ± 0.08 


55.8 


£ T CTCGAGAA 










ITGAGCTCAC 


-52.2 ± 1.0 


-140.8 ± 2.4 


-8.38 ± 0.11 


54.9 


CACTCGAGTI 










£T GAGC TCAJ 


-55.1 ± 0.8 


-150.5 ± 1.9 


-8.42 ± 0.08 


53.2 


1ACTCGAGT£ 










1GTAGCTAC£ 


-58.0 ± 1.4 


-159.6 ± 3.7 


-8.39 ± 0.10 


52.9 


£C ATCG ATG1 










£GTAGC TACI 
1C ATCGATG£ 


-59.4 ± 0.9 


-164.6 ± 2.1 


-8.21 ± 0.06 


51.7 


1CGATATCG£ 
£ G C T A T AGCJ 


-61.7 ± 1.1 


-170.6 ± 3.2 


-8.33 ± 0.12 


53.7 


£CGAT ATCG2 

T r f T AT A Tt r r 


-57.9 ± 1.1 


-159.5 ± 2.7 


-8.11 ± 0.12 


52.7 


1AGAGCTCT£ 

^ * 1 V- VJ VJ 


-55.0 ± 1.4 


-148.0 ± 3.5 


-8.80 ± 0.12 


57.5 


CA GAGCTCT1 
TTCTCGAOAr 


-51.5 ± 1.1 


-137.9 ± 2.6 


-8.44 ±0.15 


56.4 


£T GAGC T C A A 
AACGCGAGTfi 


-54.2 ± 1.4 


-146.1 ± 3.3 


-8.77 ±0.16 


56.8 


ATGAGCTCAfi 
fiACTCGAGT A 


-55.4 ± 1.2 


-148.6 ± 3.1 


-9.03 ±0.14 


59.1 



49 



WO 01/94611 



PCTAJS01/18424 



Table 1 : Continued 8 . 













AH° b 


AS 


AG 37 


TV 












fkcal / mol) 


(cal / mol K) 


(kcal / mol) 


(°C) 


G G 


T A 


G C 


T A C 


A 


-59.4 i 1.6 


-163.3 ± 3.9 


-8.76 ± 0.15 


53.S 


AC 


A T 


C G 


AT G 


G 










a n 


T A 


G C 


T A C 


G 


-63.7 ± 1.6 


-174.9 ± 4.1 


-9.46 ± 0.18 


56.4 


GC 


AT 


C G 


AT G 


A 










n c 


G A 


T A 


T C G 


A 


-60.4 ± 0.8 


-166.6 ± 2.1 


-8.50 ± 0.09 


53.5 


AG 


C T 


AT 


A G C 












A C 


G A 


T A 


T C G 




-61.1 ± 1.3 


-167.5 ± 3.4 


-9.04 ±0.14 


55.8 


£G 


C T 


AT 


A G C 


A 










£ A 


G A 


G C 


T C T 


A 


-54.0 ± 1.1 


-144.9 ± 2.7 


-8.82 ±0.14 


57.9 


AT 


C T 


C G 


A G A 


G 










A A 


G A 


G C 


T C T 


Q. 


-54.8 ± 1.7 


-148.0 ± 5.0 


-8.90 ± 0.09 


56.3 


G T 


C T 


C G 


A G A 


A 






• 




T T 


G A 


G C 


T C A 


G_ 


-56.8 ± 0.7 


-155.5 ± 1.7 


-8.64 ± 0.06 


53.5 


£ A 


C T 


C G 


A GT 


1 










G T 


G A 


G C 


T C A 


1 


-57.4 ± 0.7 


-156.7 ± 2.0 


-8.80 ± 0.05 


54.7 


I A 


C T 


C G 


A G T 


a 










T G 


T A 


G C 


T A C 


G 


-59.2 ± 1.3 


-161.3 ± 3.6 


-8.93 ± 0.08 


56.2 


QC 


AT 


C G 


AT G 


I 










G_ G 


T A 


G C 


T A C 


I 


-64.8 ± 2.6 


-182.3 ± 7.2 


-8.63 ±0.17 


50.0 


1 C 


A T 


C G 


A T G 












I C 


G A 


T A 


T C G 




-63.3 ± 0.6 


-176.9 ± 1.6 


-8.42 ± 0.05 


51.2 


QC 


C T 


AT 


A G C 


I 










fiC 


G A 


T A 


T C G 


I 


-63.7 ± 0.9 


-177.4 ± 2.2 


-8.73 ± 0.12 


52.6 


I G 


C T 


A T 


A G C 
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Table I: Continued 11 . 





AH ob 


AS 


aH° b 
ciu 37 






(kcal / mol) 


(cal / mol KJ 




\ 


_I_ A U A U u I V-» 1 V£ 


-57 8 ± 0 6 


-157.4 ± 1.7 


-8.95 ± 0.03 


55.6 


STCTCGAGAl 










GAGAGCTCT1 


-57.3 ± 1.2 


-155.5 ± 3.3 


-8.95 ± 0.10 


56.4 


JTCTCGAGAfi 










Core seq\iences 










cgatatcg" 


-51.9 ± 0.6 


-145.3 ± 1.4 


-6.89 ± 0.09 


44.1 


GCTATAGC 










GTAGCTAC d 


-51.6 ± 0.6 


-143.7 ± 1.3 


-7.01 ± 0.08 


45.6 


CATCGATG 










AGAGCTCT 


-50.0 ± 0.7 


-136.5 ± 1.7 


-7.76 ± 0.06 


50.2 


TCTCGAGA 










TGAGCTCA 


-50.5 ± 0.5 


-137.7 ± 1.3 


-7.73 ± 0.04 


50.4 


ACTCGAGT 











a The top strand of each duplex is represented in the 5* to 3' orientation and the bottom strand 
is shown in the 3* to 5 ! direction. Terminal mismatch nearest neighbors are represented in bold. 

Mismatches are underlined. b AH 0 , AS 0 , and AG° 37 are the error-weighted averages of the 1/T M vs. 
In C T plot and curve fit methods in Table SI. Errors reflect the precision of the data (see text). 

c T M calculated using 10^ M total strand concentration. d Data from reference (19). 
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Table 2: Nearest-neighbor thermodynamic paramters of 



like-with-like base terminal mismatches in 1 M NaCl 


Dimer 


AH 0 6 


AS 00 


AG° 37 D 


Sequence* 


(kcal/mol) 


(e.u) 


(kcal/mol) 




Terminal A* A Mismatches 




A&J7A 


-3.1 ± 1.3 


-7.8 ± 2.0 


-0.67 ± 0.06 


TA/AA 


-2.5 ± 0.8 


-6.3 ±2.1 


-0.58 ± 0.07 


CA/GA 


-4.3 ± 1.0 


-10.7 ± 2.6 


-1.01 ±0.07 


GA/CA 


-8.0 ± 0.7 


-22.5 ± 1.9 


-0.99 ± 0.06 




Terminal OC Mismatches 




A£/T£ 


-0.1 ± 0.6 


0.5 ± 1.5 


-0.21 ± 0.06 


T£/A£ 


-0.7 ± 0.7 


-1.3 ± 1.8 


-0.29 ± 0.07 


C£/G£ 


-2.1 ± 0.9 


-5.1 ±2.5 


-0.52 ± 0.09 


G£/C£ 


-3.9 ± 0.7 


-10.6 ± 1.7 


-0.62 ± 0.06 




Terminal G'G Mismatches 




AQJTQ 


-1.1 ±0.7 


-2.1 ± 1.8 


-0.42 ± 0.07 


TQ/AQ 


-1.1 ±0.8 


-2.7 ± 2.2 


-0.29 ± 0.05 


CQIGQ 


-3.8 ± 0.6 


-9.5 ± 1.5 


-0.83 ± 0.05 


GQ/CQ 


-0.7 ± 0.5 


-19.2 ± 1.3 


-0.96 ± 0.06 




Terminal T«T Mismatches 




AJ/TI 


-2.4 ± 0.6 


-6.5 ± 1.6 


-0.45 ± 0.05 


TI/AI 


-3.2 ± 0.7 


-8.9 ± 2.1 


-0.48 ± 0.05 


CJ/GI 


-6.1 ± 0.5 


-16.9 ± 1.2 


-0.87 ± 0.05 


Gl/CI 


-7.4 ± 0.4 


-21.2 ± 1.1 


-0.86 ± 0.05 



a Thermodynamic parameters and their corresponding errors are 
calculated from Table 1 using equations 4 and 5. 
b Dimers are given in antiparallel orientation (e.g. A£/TA equals 
S'-AC-S 1 paired with 3*-TA-5'). Mismatches are underlined. 
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Table 3: Nearest-neighbor thermodynamic paia^i&iUQF 
mixed-base terminal mismatches in 1 M NaCl 



Dimer 




AS ob 


AG 0 3 7 b 


sequence 0 


(kcal/mol) 


(e.u) 


(kcal/mol) 




Tenninal A«C Mismatches 




AA/T£ 


-1.6 ± 0.7 


-4.0 ±2.1 


-0.35 ± 0.04 


A£/TA 


-1.8 ±0.7 


-3.8 ± 1.7 


-0.59 ± 0.08 


CA/G£ 


-2.6 ± 0.8 


-5.9 ± 1.8 


-0.76 ± 0.07 


CQ/GA 


-2.7 ± 0.7 


-6.0 i 1.6 


-0.85 ± 0.09 


GA/C£ 


-5.0 ± 0.4 


-13.8 ± 1.0 


-0.71 ± 0.05 


G£/CA 


-3.2 ± 0.9 


-7.1 ± 2.2 


-1.01 ±0.10 


TA/A£ 


-2.3 ± 0.5 


-5.9 ± 1.1 


-0.45 ± 0.05 


T£/AA 


-2.7 ± 0.8 


-7.0 ± 2.4 


-0.55 ± 0.05 




Terminal OT Mismatches 




AQ/TI 


-0.9 ± 0.5 


-1.7 ± 1.4 


-0.33 ± 0.06 


Al/T£ 


-2.3 ± 0.5 


-6.3 ± 1.2 


-0.35 ± 0.05 


CC/GI 


-3.2 ± 0.8 


-8.0 ± 2.0 


-0.69 ± 0.07 


d/G£ 


-3.9 ± 0.6 


-10.6 ± 1.2 


-0.60 ± 0.05 


GC/CI 


-4.9 ± 0.6 


-13.5 ± 1.7 


-0.72 ± 0.08 


GI/C£ 


-3.0 ± 0.6 


-7.8 ± 1.5 


-0.61 ± 0.08 


TC/AX 


-2.5 ± 0.8 


-6.3 ± 2.0 


-0.52 ± 0.07 


TX/AQ 


-0.7 ± 0.6 


-1.2 ± 1.6 


-0.34 ± 0.08 




Terminal G*A Mismatches 




AA/Tfi 


-1.9 ±0.7 


-4.4 ± 1.8 


-0.52 ± 0.08 


AQ/TA 


-2.5 ± 0.7 


-5.9 ± 1.7 


-0.65 ± 0.07 




-3.9 ± 0.8 


-9.6 ± 2.1 


-0.88 ± 0.09 


CQ/GA 


-6.0 ± 0.9 


-15.5 ± 2.1 


-1.23 ±0.1 


GA/Cfi 


-4.3 ± 0.5 


-11.1 ± 1.3 


-0.80 ± 0.06 


Gfi/CA 


-4.6 ± 0.7 


-11.4 ± 1.8 


-1.08 ±0.09 


TA/AQ 


-2.0 ± 0.7 


-4.7 ± 1.6 


-0.53 ± 0.07 


Tfi/AA 


-2.4 ± 0.9 


-5.8 ± 2.7 


-0.57 ± 0.05 
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Table 3: Continued 


Dimer 


AH ob 


AS"* 


AG° 37 b 


a 

sequence 


(kcal/mol) 


(e.u) 


/l>nni /rn nil 

(kcai/moi; 




Terminal G»T Mismatches 




AQ/TX 


-3.2 ± 0.4 


-8.7 ± 1.1 


-0.45 ± 0.04 


AI/TQ 


-3.5 ± 0.4 


-9.4 ± 1.2 


-0.54 ± 0.03 


Cfi/GI 


-3.8 ± 0.7 


-9.0 ± 1.9 


-0.96 ± 0.06 


CI/GS 


-6.6 ± 1.3 


-18.7 ± 3.6 


-0.81 ± 0.09 


GQ/CI 


-5.7 ± 0.4 


-15.9 ± 1.0 


-0.76 ± 0.05 


GJ/CQ 


-5.9 ± 0.5 


-16.1 ± 1.3 


-0.92 ± 0.07 


TQ/AX 


-3.9 ± 0.5 


-10.5 ± 1.2 


-0.59 ± 0.03 


TX/AQ 


-3.6 ± 0.7 


-9.8 ± 1.9 


-0.59 ± 0.06 



a Thermodynamic parameters and their corresponding errors 
are calculated from Table 1 using equations 4 and 5. 

b Dimers are given in antiparallel orientation (e.g. A£/TA equals 
5'-AC-3* paired with 3'-TA-5'). Mismatches are underlined. 
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Table SI : Thermodynamic Parameters for Duplex Formation in 1M NaCl 



ATGAGCTCAA 
AACTCGAGTA 

AAGAGCTCTA 
ATCTCGAGAA 

AGTAGCTACA 
ACATCAATGA 

ACGATATCGA 
AGCTAT A G C A 

£ T GAGC T C A C 
£ACTCGAGT£ 

£A GAGC T CT £ 
£ T CTCGAGA£ 

£GT AGC T AC £ 
£C ATCG ATG£ 

£CGAT ATCG£ 
£GCTAT AGC£ 

£ T GAGC TCAfi 
£ACTCGAGT£ 



AH 0 
(kcal / mol) 

-56.0 ±3.2 

-57.6 ± 4.0 

-54.6 ± 1.8 

-56.0 ± 2.3 

-59.9 ± 2.2 

-61.3 ± 3.3 

-60.5 ± 2.4 

-70.8 ± 1.5 

-52.3 ± 4.0 

-50.4 ± 1.2 

-55.6 ± 2.5 

-49.9 ± 1.5 

-55.1 ± 2.4 

-56.8 ± 2.7 

-57.3 ± 2.6 

-60.5 ± 1.3 

-55.2 ± 2.1 

-50.9 ± 1.7 



AS 0 
(cal/molK) 

-151.4 ± 10.1 
-156.5 ± 4.0 

-146.9 ± 5.3 
-15L6 ± 5.4 



AG°, 7 
(kcal / mol) 

-9.07 ±0.13 
-9.11 ± 0.46 

-8.91 ± 0.12 
-8.97 ± 0.64 



-164.0 ± 6.6 -9.03 ± 0.12 
-168.3 ± 7.9 -9.09 ± 0.83 



-166.5 ± 7.7 
-198.4 ± 3.9 



-8.86 ± 0.06 
-9.28 ± 0.34 



-142.4 ±12.7 -8.16 ±0.11 

-136.6 ± 2.8 -8.06 ± 0.36 

-152.1 ± 7.7 -8.36 ± 0.14 

-134.5 ± 3.5 -8.14 ± 0.45 



-151.8 ± 7.2 

-156.9 ± 6.6 

-158.4 ± 8.1 

-168.3 ± 3.3 

i 

-150.4 ± 6.4 

-137.1 ± 3.9 



-8.04 ± 0.16 
-8.08 ± 0.70 

-8.12 ± 0.07 
-8.21 ± 0.31 

-8.58 ± 0.14 
-8.38 ± 0.48 



T p 
(°C) 

57.0 
56.7 

57.7 
56.4 

55.4 
55.3 

54.3 
53.6 

52.4 
52.4 

52.8 
53.1 

50.9 
50.7 

50.8 
50.9 

54.2 
54.4 



fiAGAGCTCTfi c -56.8 ± 2.0 
£TCTCGAGA£ d -48.0 ± 1.9 



-155.1 ± 5.6 -8.72 ± 0.26 54.6 
-128.1 ± 5.9 -8.30 ± 0.09 54.9 
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Table SI: Continued. 











AH° 


AS' 




AG° J7 












(kcal / mol) 


(cal/molK) 


(kcal /mol) 


(°C) 




T A G C 


T AC £ 


C 


-56.9 ± 


1.5 


-155.4 ± 


4.8 


-8.67 ± 0.07 


54.2 


y c 


a t r* a 


a t r r 
A 1 u ^ 


d 


-61.1 ± 


1.4 


-168.5 ± 


3.4 


-8.83 ± 0.35 


53.9 


QC 


G A T A 


TCGfi 


c 


-62.4 ± 


2.4 


-172.9 ± 


7.4 


-8.79 ± 0.08 


53.3 


QG 


C T A T 


AGC £ 


d 


-66.2 ± 


0.9 


-184.7 ± 


2.3 


-8.93 ± 0.21 


53.0 


T T 


G A G C 


T C A Z 


c 


-56.8 ± 


1.5 


-155.1 ± 


4.7 


-8.63 ± 0.10 


54.2 


I A 


C T C G 


A G T X 


d 


-J J. / = 


1 7 

1. / 


« ± 
-Inj.D ■*- 




-8.52 ± 0.46 


54.4 






T P T T 


c 


-58.7 ± 


1.6 


-160.3 ± 


4.6 


-8.96 ±0.19 


55.4 


I T 


C T C G 


A G A I 


d 


-52.9 ± 


2. 1 


-142. J ± 


0.4 


-8.66 ±0.10 


55.6 


I G 


T A G C 


T A C X 


c 


-63.2 ± 


1.1 


-175.5 ± 


3.4 


O *7C _L t\ (\£L 

-3.75 ± U.Uo 


52.9 


1 c 


A T C G 


A T G X 


d 


£. A A l 

-64.4 ± 


1 1 

1.1 


1 TO A _L 

-179.4 ± 


2.0 


-8.80 ± 0.25 


52.8 


I C 


G A T A 


T C G X 


c 


-64.6 ± 


1.3 


-180.4 ± 


4.0 


o co _i_ r\ ac 
-8.59 ± U.U5 


ci n 
51.7 


I G 


C T A T 


A G C X 


d 


-67.4 ± 


0.7 


-189.4 ± 


1.8 


-8.69 ± 0.16 


51.5 


P T 


n a n p 

U A U t 


T P A A 

1 1\ £\ 


c 


-56.0 ± 


2.1 


-153.1 ± 


6.2 


-R SR 4- 0 1 S 

-O.Jo 3: U.IJ 


jj.y 


A A 


C T C G 


A G T £ 


d 




1 7 


-140.6 ± 


5.3 


-8.39 ± 0.07 


54.1 


A T 


n a n p 


T P A C 


c 


-57.0 ± 


2.4 


-154.9 ± 


7.1 


-o.y^ a: v.iQ 




£A 


C T C G 


A GT A 


d 


-52.7 ± 


1.6 


-141.7 ± 


3.6 


-8.73 ± 0.46 


56.1 


r g 


T A G C 


T A C A 


c 


-57.8 ± 


2.9 


-158.8 ± 


8.9 


-r + o m 




AC 


A T C G 


A T G £ 


d 


-56.5 ± 


1.6 


-155.0 ± 


3.8 


-8.46 ± 0.42 


53.0 


AG 


T A G C 


T A C £ 


c 


-59.4 ± 


4.0 


-163.2 ± 12.2 


-8.74 ± 0.18 


53.9 


£C 


A T C G 


A T G A 


d 


-56.8 ± 


1.3 


-155.5 ± 


3.1 


-8.60 ± 0.35 


53.8 


£C 


G A T A 


T C G A 


c 


-61.6 ± 


1.1 


-171.8 ± 


3.5 


-8.30 ± 0.03 


50.8 


AG 


C T A T 


A G C £ 


d 


-61.9 ± 


0.6 


-172.9 ± 


1.5 


-8.30 ±0.14 


50.7 
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Table SI: Continued. 











AH° 


AS 0 


AG°, 7 


t b 
T M 










(kcal / mol) 


(cal/molK) 


(kcal / mol) 


(°C) 


A C 


/-^ ATA 

G A T A 


T f~* f~* 

1 t u L 


C 


-61. o ± 


3. J 




4- 
X 


Q /% 


S QA. 


X 


0 1 Q 


54 3 


£G 


C T A T 


A G C A 


d 


-57.0 dt 


1.9 


-155.8 


X 


4.6 


-8.71 


X 


0.52 


54.4 


£ A 


G A G C 


T C T A 


c 


-55.7 ± 


1.1 


451.6 


X 


3.5 


-8.67 


X 


0.07 


54.6 


AT 


C T C G 


A G A £ 


d 


-54.2 ± 


0.7 


-147.2 


X 


1.6 


-8.59 


X 


0.18 


54.6 


A A 


G A G C 


T C T £ 


c 


-58.6 ± 


4.1 


-159.8 


± 


12.4 


-9.07 


X 


0.26 


56.1 


£T 


C T C G 


A G A A 


d 


-55.1 ± 


1.5 


-149.1 


X 


A "1 

4.7 


-8.83 


X 


A no 

0.08 


ceo 

55.8 


I T 


G A G C 


T C A £ 


c 


-54.1 ± 


1.9 


-147.4 


X 


5.8 


-8.40 


X 


0.12 


53.4 


£ A 


C T C G 


A G T I 


d 


-51.5 ± 


1.1 


-139.4 


X 


2.6 


-8.27 


X 


0.32 


53.5 


£ T 


G A G C 


T C A I 


c 


-57.3 ± 


4.5 


-157.5 


_i 
X 


14.3 


-8.43 


1 

X 


0.09 


52.7 


1 A 


C T C G 


A GT £ 


d 


-55.0 ± 


0.8 


-150.3 


X 


1.9 


-8.37 


X 


0.22 


53.0 


I G 


T A G C 


T A C £ 


c 


r 58.7 ± 


2.1 


-162.3 


X 


6.5 


-8.39 


X 


0.10 


52.0 




A T C G 


A T G X 


d 


-57.4 ± 


1.9 


-158.2 


X 


4.6 


-8.32 


X 


0.48 


52.0 


£ G 


T A G C 


T A C I 


c 


-59.3 ± 


1.6 


-160.7 


X 


5.1 


-8.21 


X 


0.07 


58.1 


I C 


A T C G 


A T G £ 


d 


-59.5 ± 


1.2 


-165.4 


X 


2.3 


-8.19 


X 


0.28 


50.7 


T C 


Cr A 1 A 


T C* 

I I b L 


c 


-62.8 ± 


1.2 


-175.7 


X 


3.9 


-8.34 


X 


0.12 


50.7 


£G 


C T A T 


AG C T 


d 


-57.7 ± 


2.4 


-159.6 


X 


5.7 


-8.16 




0.59 


51.0 




n A T A 


T C tZ T 


c 


-61.1 ± 


1.8 


-170.6 


X 


5.5 


-8.13 


X 


0.13 


5U.0 


I G 


C T A T 


A G C £ 


d 


-56.3 ± 


1.3 


-155.8 


X 


3.1 


-7.98 


X 


0.34 


50.2 


I A 


G A G C 


T C T £ 


c 


-56.4 ± 


2.0 


-153.5 


X 


6.3 


-8.81 


X 


0.12 


55.2 


£ T 


C T C G 


A G A I 


d 


-53.8 ± 


1.8 


-145.5 


X 


4.3 


-8.69 


X 


0.52 


55.4 


£ A 


G A G C 


T C T I 


c 


-54.8 ± 


2.1 


-149.4 


X 


6.4 


-8.47 


X 


0.17 


53.7 


I T 


C T C G 


A G A £ 


d 


-50.3 ± 


1.2 


-135.6 


X 


2.8 


-8.28 


X 


0.36 


53.9 
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Table SI: Continued. 







AH° 
(kcal / mol) 


AS 0 
(cal/molK) 


AG° 37 
(kcal / mol) 


(°C) 


fiTGAGCTCAA 
AACGCG AGTfi 


c 
d 


-56.6 ± 
-53.5 ± 


2.8 
1.6 


-154.3 ± 
-144.6 ± 


8.4 
3.6 


-8.79 ± 0.17 
-8.64 ± 0.44 


55.1 
55.2 


ATGAGCTCAfi 
fiACTCGAGTA 


c 
d 


-57.7 ± 
-53.8 ± 


1.9 
1.6 


-157.0 ± 
-145.1 ± 


5.6 
3.6 


-9.05 ±0.15 
-8.85 ± 0.45 


56.3 
56.5 


fiGTAGCTACA 
ACATCGATGfi 


c 
d 


-59.2 ± 
-59.4 ± 


4.2 
1.7 


-162.7 ± 
-163.3 ± 


12.9 
4.1 


-8.77 ±0.16 
-8.72 ± 0.44 


54.1 
53.8 


AGTAGCTACS 
fiCATCGATGA 


c 
d 


-63.3 ± 
-63.8 ± 


3.4 
1.8 


-173.4 ± 
-175.2 ± 


10.2 
4.5 


-9.46 ± 0.20 
-9.47 ± 0.45 


56.8 
56.6 


fiCGATATCGA 
AGCTATAGCG 


c 
d 


-62.1 ± 
-59.1 ± 


1.3 
1.1 


-172.8 ± 
-163.6 ± 


3.8 
2.6 


-8.51 ± 0.10 
-8.39 ± 0.27 


51.8 
51.9 


ACGATATCGfi 
fiGCTAT AGCA 


c 
d 


-63.1 ± 
-60.4 ± 


2.6 
1.6 


-174.1 ± 
-166.0 ± 


7.9 
3.7 


-9.06 ±0.15 
-8.92 ± 0.40 


54.6 
54.6 


fiAGAGCTCTA 
ATCTCGAGAS 


c 
d 


-56.4 ± 
-52.8 ± 


1.9 
1.3 


-153.5 ± 
-142.4 ± 


5.6 
3.1 


-8.84 ±0.15 
-8.67 ± 0.38 


55.4 
55.6 


AAGAGCTCTfi 
STCTCGAGAA 


c 
d 


-57.1 ± 
-53.1 ± 


2.6 
2.2 


-154.7 ± 
-142.8 ± 


7.6 
6.7 


-9.08 ± 0.20 
-8.86 ± 0.10 


56.7 
56.8 


ITGAGCTCAS 
GACTCGAGTJ 


c 
d 


-55.2 ± 
-57.2 ± 


1.5 
0.8 


-150.0 ± 
-156.4 ± 


4.5 
1.8 


-8.63 ± 0.07 
-8.71 ± 0.21 


54.5 
54.3 


fiTGAGCTCAI 
IACTCGAGTS 


c 
d 


-57.3 ± 
-57.6 ± 


0.7 
1.6 


-156.5 ± 
-157.4 ± 


2.3 
3.7 


-8.80 ± 0.05 
-8.81 ± 0.42 


54.9 
54.8 


IGTAGCTACfi 
fiCATCGATGI 


c 
d 


-59.9 ± 
-58.2 ± 


1.8 
2.0 


-164.5 ± 
-158.9 ± 


5.5 
4.8 


-8.93 ± 0.08 
-8.86 ± 0.53 


54.8 
55.0 


SGTAGCTACI 
ICATCGATGfi 


c 
d 


-62.8 ± 
-67.1 ± 


3.6 
3.8 


-174.8 ± 
-187.8 ± 


11.0 
9.4 


-8.62 ±0.17 
-8.79 ± 0.85 


52.3 
52.1 
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Table SI: Continued. 



ICGATATCGfi 
fiCCTAT AGC1 

fiCGATATCGl 
IGCTAT AG C Q 

1AGAGCTCT£ 
QT CTCG AGAI 

fiAGAGC TCTI 
ITCTCGAGAfi 



AH 
(kcal / mol) 

c -58.9 ± 2.8 

d -63.5 ± 0.6 

c -61.6 ± 3.7 

d -63.9 ± 0.9 

c -56.4 ± 1.5 

d -58.0 ± 0.6 

c -57.7 ± 1.6 

d -56.6 ± 1.9 



AS 0 
(cal / mol K) 

-163.0 ± 8.9 
-177.3 ± 1.6 

-170.4 ±11.5 
-177.6 ± 2.3 

-153.3 ± 4.5 
-158.1 ± 1.9 

-157.2 ± 4.7 
-153.9 ± 4.6 



AG° 37 
(kcal / mol) 

-8.40 ± 0.06 
-8.53 ± 0.15 

-8.71 ± 0.14 
-8.77 ± 0.22 

-8.89 ± 0.06 
-8.96 ± 0.03 

-8.95 ± 0.10 
-8.90 ± 0.52 



(°C) 

52.0 
51.6 

53.1 
52.8 

55.7 
55.6 

55.7 
55.7 



Core sequences 

C G AT A T C G « 
GCTAT AGC 

G T AGC T AC : 
CATCGATG 

AGAGC TCT 
T CTCGAGA 

TGAGCTCA 
ACTCGAGT 



c -55.7 ± 3.9 -157.1 ± 

d -51.8 ± 0.6 -145.1 ± 

c -55.1 ± 2.3 -155.0 ± 

d -51.4 ± 0.6 -143.3 ± 

c -49.5 ± 1.8 -134.5 ± 

d -50.1 ± 0.8 -136.7 ± 

e -50.7 ± 0.7 -138.4 ± 

d -50.3 ± 0.7 -137.3 ± 



12.1 -6.93 ± 0.12 44.1 

1.4 -6.82 ± 0.15 44.0 

7.0 -7.04 ± 0.10 44.9 

1.3 -6.95 ± 0.14 44.9 

5.7 -7.76 ± 0.07 50.6 

1.8 -7.76 ± 0.22 50.5 

2.2 -7.73 ± 0.04 50.1 

1.6 -7.72 ± 0.18 50.1 



a The top strand of each duplex is represented in the 5' to 3' orientation and the bottom strand 
is shown in the 3' to 5' direction. Terminal mismatch nearest neighbors are represented in bold. 
Mismatches are underlined. b T M calculated using 10" 4 M total strand concentration. 

c Thermodynamic parameters from averaging the fits of melting curves. Reported errors are 
standard deviations in the precision of the data. d Thermodynamic parameters from T M " vs. ln(C T ) 

plots. Reported errors are standard deviations in the precision propagated from the slope and 
intercept of the 1/T M vs. In C T plot. ' Data from reference (19). 
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Table 1 : Thermodynamic Parameters for Hairpin Oligomer Association and Oligomer Dupiex 
Formation. 

AH 0 AS 0 AGV t m 

(kcal/mol) (cai/moIK) (kcal moi) i°c, 

SvsiSinS ™ th Elementary Interfaces* 



C 



AAGCCTT GA-ACAACG c -50.7 * 4.1 -141.7 = 11.3 -6.74 = 0.27 49.2 
C GCGGAA C T / TGTTGC (i) 



r A AGC C TT G T - TC AACG 8 -52.8 = 4.2 -146.3 = 11.7 -7.43 = 0.30 53.1 
^•C GCGGAA CA/AGTTGC 



GCAACT - TGTTCCGA A-\ e -63.5 ± 5.1 -179.2 ± 14.3 -7.99 ± 0.32 53.2 
CGTTGA/ACAAGGCCC^ 



c 



c 



c 



c 



c 



c 



A AGC C TT G A - TCAACG e -53.6 ± 4.3 -149.3 ± 11.9 -7.34 = 0.29 52.3 

CGCGGAA CT / AGTTGC <iii> 

A AGC CTT GT - AC AACG c -45.1 ± 3.6 -124.8 ± 10.0 -6.42 * 0.26 48.4 
CGCGGAA CA/ TGTTGC 

A AGC C TT G C - AC AACG c -46.1 * 3.7 -128.5 ± 10.3 -6.26 = 0.25 46.9 
CGCGGAA CG / TGTTGC 

A AGC CTT GT -GCAACG e -52.2 ± 4.2 -144.4 = 11.5 -7.39 = 0.30 53.1 
CGCGGAACA/CGTTGC 

A AGC C TT GG - TCAACG e -53.6 = 4.3 -148.2 =11.9 -7.67 = 0.31 54.4 
CGCGGAA CC/ AGTTGC 

AAGCCTT GA - CCAACG * -51.3 = 4.1 -140.2=11.2 -7.81=0.31 56.2 
CGCGGAACT/GGTTGC 
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Table 1 : Continued. 

AH 0 AS 0 AGV T m 

(kcal/mol) (cal/molK) (kcal moh ( v *Ci 

r AAGCCTT GC - TCAACG e -46.1 i 3.7 -126.3 ± 10.1 -6.90 = 0.28 51." 
^CGCGGAA CG/AGTTGC 

A AGC C T T G A - GCAAC G e -48.2 ± 3.9 -131.7 ± 10.5 -7.32 ± 0.29 53.9 



c 



C GCGGAA C T / CGTTGC 



GC A AC A - G G T TCCGAA-v e -51.9 ± 4.2 -147.0 ± 11.8 -6.34 * 0.25 46.3 
CGTTGT / C CAAGGCC C-' 

AGC C T T G G - AC AAC G c -50.4 ± 4.0 -139.5 ± 11.2 -7.10 ± 0.28 51.7 
^C GCGGAA C C / TGTTGC 

rAAGCCTT GT - CCAACG c -54.2 ± 4.3 -147.9 ± 11.8 -8.29 ± 0.33 58.3 
^CGCGGAACA/GGTTGC 

.AAGCCTT GC-GCAACG c -47.6 ± 3.8 -130.9 ± 10.5 -6.96 ± 0.28 51.6 
^•CGCGGAACG/CGTTGC 

A AGC C T T G G - CCAACG e -52.1 ± 4.2 -140.1 ± 11.2 -8.67 * 0.35 61.9 
C GCGGAA C C / GGTTGC 

AAGCCTT GC - CCAACG c -53.3 ± 4.3 -146.5 ± 11.7 -7.92 ± 0.32 56.1 
CGCGGAACG/GGTTGC 

-AAGCCTTGG-GCAACG e -49.3 ± 3.9 -1352 ± 10.8 -7.35 ± 0.29 53.8 
^CGCGGAACC/CGTTGC 



c 



c 
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Table 1: Continued.. 



AH 0 AS 0 AG° 3 , T M 

(kcal/mol) (cal/molK) (kcal/mol) { X) 

Systems with Dangling Ends at th» Wrfar,> b 

^AAGCCTT GC-GCAACG e -44.4 * 3.6 -121.4 * 9.7 -6.76 * 0.27 51.1 

AAGCCTTGC-GCAACG ' -48.0 * 3.8 -131.8*10.5 -7.16 * 0.29 52.9 



CGCGGAA CG/CGTTGC 
A 



C GCGGAA C G / CGTTGC 

A 



c 

^AAGCCTTGC-GCAACG e -46.3 * 3.7 -127.4 * 10.2 -6.83 * 0.27 51.0 
^CGCGGAACG/CGTTGC 
T 

.-AAGCCTTGC-GCAACG e -49.0 * 3.9 -136.6 ± 10.9 -6.59 * 0.26 48.7 
^-C GCGGAA CG / CGTTGC 

T 

AAGCCTTGC-GCAACG c -37.6* 3.0 -102.0 * 8.2 -5.91 * 0.24 46.4 



CGCGGAACG/ CGTTGC 
A A 



c 

^-AAGCCTTGC-GCAACG e -36.2 ± 2.9 -97.3 * 7.8 -6.03 * 0.24 47.8 
^•C GCGGAA C G / CGTTGC 
T T 

^-AAGCCTTGC-GCAACG '-44.0*3.5 -123.5*9.9 -5.67 * 0.23 43.0 
^CGCGGAACG/CGTTGC 
A T 

^.AAGCCTTGC-GCAACG e -43.6 * 3:5 -119.5 * 9.6 -6.53*0.26 49.6 
^■C GCGGAA C G / CGTTGC 
T A 

> A A G C C T T G G - T C A A C G e -47.2 * 3.8 -129.8± 10.4 -6.90 * 0.28 51.3 
^•CGCGGAACC/ AGTTGC 
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Table 1: Continued. 



AH 0 AS 0 AG° 37 T M 

(kcal/mol) (cal/molK) (kcal/mol) ( °C) 

Systems with Extra Central NuclentiH* at the Interface" 

-AAGCCTTGCAGCAACG e -44.4 ± 3.6 -122.2 ± 9.8 -6.50 ± 0.26 49.2 
HGCGGAA CG / CGTTGC 

-AAGCCTTGTACCAACG 6 -45.0 ± 3.6 -124.6 ± 10.0 -6.32 ± 0.25 47.7 
^CGCGGAA CA/GGTTGC 

Oligomers 

TCAACG c -38.5 ± 3.1 -108.3 ± 8.7 -4.94 ± 0.20 38.0 
AGTTGC 



ACAACG e -36.1 ± 2.9 -99.3 ± 7.9 -5.32 ± 0.21 41.4 
TGTTGC 



GCAACG e -42.5 ± 3.4 -117.1 ± 9.4 -6.21 ± 0.25 47.5 
CGTTGC 



CCAACG 8 -38.6 ± 3.1 -1062 ± 8.5 -5.69*0.23 44.2 
GGTTGC 

TGTTGC c -37.1 ± 3.0 -101.1 ± 8.1 -5.79 ± 0.23 45.3 
ACAACG 



AGTTGC e -37.0 ± 3.0 -101.0 ± 8.1 -5.70 ± 0.23 44.2 
TCAACG 
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a T M calculated using 10"* total strand concentration. 

b The top strand of each system is conventionally represented in the 5' to 3' orientation. Nucleotides 
involved in coaxial stacking interfaces are represented in bold. 

c Parameters obtained by averaging the results of melt fit and TM" 1 vs. ln(C T /4) plot methods. 
Errors are estimated to be 8% for AH° and AS 0 and 4% for AG° 37 . 
(0. do. (iii) ^felting cur ves for these systems are shown in Figure 2. 
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Table 2: Thermodynamic Parameters for Coaxial Stacking*. 



AH°(coaxial stacking) 
(kcal/mol) 



AS°(coaxial stacking) AG°3 7 (coaxial stacking) 
(cal/molK) (kcal/mol) 



Flementarv Interfaces 

GA - AC 
CT / T G 

GT-TC 
C A / AG 

CT - TG 
GA / AC 

GA-TC 
CT / AG 

GT - AC 
CA / TG 

GC - AC 
CG / TG 

GT - GC 
CA / CG 

GG - TC 
CC / AG 

GA- CC 
CT /GG 

GC-TC 
CG / AG 

GA - GC 
CT / CG 

CA-GG 
GT / CC 

GG - AC 
CC / TG 



-14.6 ± 5.0 

-14.3 ± 5.2 

-26.6 ± 5.9 

-15.1 ± 5.3 

-9.0 ± 4.6 

-10.0 ± 4.7 

-9.6 ± 5.4 

-15.1 ± 5.3 

-12.7 ± 5.1 

-7.6 ± 4.8 

-5.6 ± 5.1 

-14.8 ± 5.1 

-14.2 ± 5.0 



-42.4 ± 13.8 

-38.0 ± 14.6 

-78.2 ± 16.5 

-41.0 ± 14.8 

-25.5 ± 12.8 

-29.2 ± 13.0 

-27J ± 14.9 

-39.9 ± 14.7 

-34.0 ± 14.1 

-18.0 ± 13.3 

-14.6 ± 14.1 

-45.9 ± 14.3 

-40.2 ± 13.7 



-1.42 ± 0.34 

-2.49 ± 0.36 

-2.29 ± 0.39 

-2.40 ± 0.35 

-1.10 ± 0.33 

-0.94 ± 0.33 

-1.18 ± 0.39 

-2.73 ± 0.36 

-2.12 ± 0.39 

-1.97 ± 0.34 

-1.11 ± 0.38 

-0.56 ± 034 

-1.78 ± 0.35 
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Table 2: Continued. 



AH°(coaxial stacking) AS°(coaxial stacking) AG°j 7 (coaxial stacking) 

(kcaJ/mol) " (cal/molK) (kcal/mol) 

GT-CC -15.6 ± 5.3 -41.8 ± 14.6 -2.61 ± 0.40 
CA / GG 

GC-GC -5.0 ± 5.1 -13.8 ± 14.1 -0.75 ± 0.37 
CG / CG 

GG-CC -13.5 ± 5.2 -33.9 ± 14.1 -2.98 ± 0.41 
CC / GG 

GC-CC -14.7 ± 5.3 -40.3 ± 14.5 -2.23 ± 0.39 
CG / GG 

GG-GC -6.8 ± 52 -18.1 ± 14.3 -1.14 ± 0.38 
CC/CG 



Interfaces with Dangling finds* 



GC-GC -1.9 ± 4.9 -4.3 ± 13.5 -0.55 ± 0.37 

CG/CG 
A 

GC-GC -5.5 ± 5.1 -14.7 ± 14.1 -0.95 ± 0.38 

CG / CG 
A 

GC-GC -3.8 ± 5.0 -10J ± 13.8 -0.62 ± 0.37 

CG / CG 
T 

GC-GC -6.4 ± 5.2 -19.5 ± 14.4 -0.38 ± 0.36 

CG / CG 
T 

GC-GC 5.0 ± 4.5 15.1 ± 12.4 0.30 ± 0.34 

CG / CG 
A A 
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Table 2: Continued. 



AH°(coaxial stacking) AS°(coaxial stacking) AG^coaxial stacking) 
(kcal / moi) (cal / mol K) (kcal / mol) 

GC-GC 6.3 ± 4.5 19.8 ± 12.2 0.18 ± 0.35 

CG / CG 
T T 

GC-GC -1.4 .± 4.9 -6.4 ± 13.6 0.55 ± 0.34 

CG / CG 
A T 

GC-GC -1.1 ± 4.9 -2.4 ± 13.4 -0.32 ± 0.36 

CG/CG 

T A 

A 

GG-TC -8.6 ± 4.9 -21.5 ± 13.5 -1.96 ± 0.34 

CC/AG 
A 

Interface with Extra Central Nucleotide* 

GCAGC -1.9 ± 4.9 -5.1 ± 13.5 -0.29 ± 0.36 
CG/CG 

GTACC -6.4 ± 4.7 -18.5 ± 13.1 -0.64 ± 0.34 
CA / GG 



1 These parameters and their corresponding errors are deduced from Table 1 as described in the text 
b The top strand of each duplex is conventionally represented in the 5' to 3* orientation. Nucleotides 
involved in coaxial stacking interfaces are represented in bold. 
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Table SI : Extinction coefficients of haiipins at 25 °C 



experimental' calculated? 
(LmorW) (LmorW) 



C 
C 
C 
C 
C 

c 
c 
c 
c 
c 
c 



AAGCC t tggtc aacg 

CGCGGAACC 


188847 


192310 


AAGCCTTGATCAACG 
C GCGGA ACT 


188718 


195950 


AAGCCTTGCTCAACG 
CGCGGAACG 


186139 ; 


191610 


AAGCCTTGTTCAACG 
C GCGGA ACA 


191071 


195810 


A AGCCT TGAACAACG 
C GCGGAACT 


194330 


200950 


AAGCCTTGTACAACG 
C GCGGA ACA 


193953 


200810 


AAGCCTTGGACAACG 
CGCGGAACC 


188889 


197310 


AAGCCTTGCACAACG 
CGCGGAACG 


194623 


196610 


AAGCCT TGAGC AAC G 
C GCGGAACT 


192953 


197350 


AAGCCTTGTGCAACG 
C GCGGA ACA 


195968 


197630 


AAGCCTTGGGCAACG 


195177 


193710 



CGCGGAACC 
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Table SI; Continued 



C 



AAGCCT T GCGC AAC G 
CGCGGAACG 



190944 



193010 



AAGCCTTGACCAACG 
^CGCGGAACT 



192663 



194350 



C 



AAGCCTTGTCCAACG 
CGCGGAACA 



193532 



194210 



C 



AAGCCTTGGCCAACG 
CGCGGAACC 



195094 



192390 



AAGCCTTGCCCAACG 
CGCGGAACG 



192864 



190010 



GCAACA-G TTCC AAn 
CCAAGGCCU 



200806 



196010 



GCAACT-T TTCC AAn 
ACAAGGCCCJ 



206650 



192810 • 



c 



AAGC C TTGCAGCAACG 
CGCGGAACG 



202688 



206510 



c 



AAGCCTTGTACCAACG 
CGCGGAACA 



204450 



208010 



.AAGCCTTGCCCAACG 
CGCGGAACG 
A 



191282 



193010 



AAGC CTTGCGC AAC G 
CGCGGAACG 
T 



198343 



193010 
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a Calculated with Equation 3. 
b Calculated with Equation 4. 
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Table S2: Thermodynamic Parameters for Hairpin Oligomer" Association and Oligomer Duplex 
Formation. 



Pigmentary interfaces 1 



AH° 

(kcal / mol) 



AS 0 
(cal / mol-K) 



AG° 3: 
(kcal / mol) 



( C C. 



AAnPPTTriA A P A A P O (A t 




X 


X.J 


-135 1 




8 1 

O. 1 


-6 8 




0 1 


- V. 1 


^CGCGGAACT/TGTTGC 


1 -53.3 


± 


2.7 


-150.1 


X 


8.7 


-6.7 


± 


0.0 


48.4 


a a^ppttht-tpaapo 00 < 






4 3 


-155 8 


X 


14 0 


-7 4 








^CGCGGAACA/ AGTTGC 4 


1 -49.9 


± 


1.7 


-136.9 


X 


5.6 


-7.4 


± 


0.0 


54.2 


A AOPPTTGA - TCAACG (»»> < 


■ -51 9 


X 


4.0 


-143.6 


x 


12.7 


-7 3 




0 1 




GCGGAACT / AGTTGC c 


1 -55.4 


X 


2.1 


-155.0 


± 


4.9 


-7.3 


± 


0.5 


51.7 


r* p a apt . t ft t t r r n a a ^ < 

U C A A L 1 • 1 U 1 1 V t VJ A ^ 


■ -61 6 


± 


2 6 


-173 1 


x 


7 9 


-8 0 


± 


0 1 


53 7 


CGTTGA / ACAAGGCCC^ c 


1 -65.5 


X 


3.1 


-185.2 


X 


9.8 


-8.0 


± 


0.0 


52.8 


AAnPPTTnT.APAAPfr C 


-48 0 




2 3 


-134 4 




7 2 




A 


0 1 


47 ^ 


GCGGAACA / TGTTGC * 


1 -42.2 


X 


1.9 


-115.2 


± 


6.3 


-6.5 


X 


0.1 


49.5 


-A AGC C T T GC - AC AA C G < 


■ -45.9 


x 


1.1 


-127.9 


X 


3.3 


-6.3 


x 


0.1 


47.1 


v-C GCGGAACG / TGTTGC * 


1 -46.3 


± 


4.0 


-129.2 


X 


13.1 


-6.3 


± 


0.1 


46.8 


-AAGC CTTGT - GCAACO « 


■ -54.7 


± 


2.7 


-152.4 


X 


9.3 


-7.4 


± 


0.2 


52.2 


GCGGAACA/ CGTTGC ' 


1 -49.7 


X 


3.4 


-136.3 


X 


11.0 


-7.4 


X 


0.1 


53.9 


r AAGCC TTGG - TCAACG « 


' -53.5 


X 


4.3 


-147.7 


X 


13.6 


-7.7 


X 


0.2 


54.4 


^-C GCGGAACC / AGTTGC ' 


1 -53.8 


X 


3.4 


-148.7 


X 


11.1 


-7.7 


X 


0.1 


54.3 


-AAGCCTTGA- CCAACG « 


» -52.9 


X 


2.2 


-145.4 


X 


7.3 


-7.8 


X 


0.1 


55.6 


GCGGAACT / GGTTGC ' 


1 -49.6 


X 


2.4 


-134.9 


X 


5.5 


•7.8 


X 


0.7 


56.8 


r AAGCCTTGCTCAACG « 


i -48.6 


X 


2.8 


•134.6 


X 


9.3 


-6.9 


± 


0.2 


50.6 


GCGGAACG / AGTTGC * 


1 -43.6 


X 


2.2 


•118.1 


X 


7.3 


-6.9 


± 


0.1 


52.8 


r A AGC C TTGA - GCAACG « 


» -49.7 


X 


3.5 


•136.6 


± 


11.4 


-7.3 


± 


0.1 


53.4 


^C GCGGAACT / CGTTGC ' 


1 -46.6 


X 


1.8 


•128.8 


± 


5.7 


-7.3 


X 


0.0 


49.9 
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Table S2: Continued. 



GCAACA - GGTTCCGAA-v 
CGTTGT /CCAAGGCCC^ 

AAGCCTTGG-ACAACG 
CcGCGGAACC/TGTTGC 



-AAGCCTTGT-CCAACG 
CcGCGGAACA/GGTTGC 

-AAGCCTTGC-GCAACG 
^CGCGGAACG/CGTTGC 

-AAGCCTTGG-CCAACG 
^CGCGGAACC/GGTTGC 

r AAGCCTTGC-CCAACG 
^CGCGGAACG/GGTTGC 

-AAGCCTTGG-GCAACG 
^■CGCGGAACC / CGTTGC 



AH° 
(kcal / mol) 

-53.1 ± 3.8 
-50.8 ± 3.3 

-48.4 ± 3.1 
-52.3 ± 2.0 

-58.0 ± 2.1 
-50.3 ± 2.1 



-47.0 ± 
-48.2 ± 

-57.3 ± 
-46.9 ± 

-55.3 ± 
-51.4 ± 

-45.9 ± 
-52.6 ± 



3.9 
0.9 

5.4 
2.7 

2.6 
1.5 

1.0 
3.5 



AS 

(cal / mol K) 

-150.8 = 13.0 

-143.3 i 10.9 

-133.1 ± 10.4 

-145.9 ± 6.5 

-160.2 ± 6.4 

-135.7 ± 4.7 



-129.0 
-132.8 

-156.4 
-123.9 



-152.7 ± 
-140.2 ± 



-124.4 
-146.0 



12.9 
2.9 

17.0 
5.8 

8.1 
3.4 

3.5 
11.3 



AG 0 -- V 
(kcal -mol) |*C> 



-6.3 = 0.2 

-6.4 = 0.1 

-7.1 = 0.1 

-7.1 s 0.0 



-7.0 
-7.0 

•8.8 
-8.5 

-7.9 
-7.9 

-7.3 
-7.4 



0.1 
0.0 

0.3 
0.9 

0.1 
0.5 

0.1 
0.1 



45.8 
46.9 



50.9 



-8.4 ± 0.2 57.1 
-8.2 ± 0.6 59.5 



51.7 
51.5 

60.1 
63.6 

55.6 
56.7 

54.9 
52.7 



Interfaces with dangling ends' 



-AAGCCTTGC-GCAACG 
^CGCGGAACG/ CGTTGC 



r AAGCCTTGC-GGAACG 
^CGCGGAACG/ CGTTGC 
T 



r AAGCCTTGC-GCAACG 
^CGCGGAACG/ CGTTGC 



-45.2 ± 2.9 
-43.7 ± 3.4 



-123.8 ± 9.4 
•119.1 ± 11.1 



-46.7 ± 3.7 -128.7 ± 11.8 
-46.0 ± 3.3 -126.2 ± 7.5 



-6.8 ± 0.2 50.8 
-6.8 ± 0.1 51.5 



-6.8 ± 0.1 50.9 
-6.8 ± 1.0 51.2 



-48.6 ± 3.0 -133.7 ± 9.4 -7.2 ± 0.1 52.7 
-47.4 ± 3.2 -129.8 ± 10.5 -7.2 ± 0.1 53.1 
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Table S2: Continued. 



c 



AAGCCTTGC-GCAACG 
CGCGGAACG/ CGTTGC 



r AAGCCTTGG-TCAACG 
^CGCGGAACC/ AGTTGC 

A 

-AAGCCTTGC-GCAACG 
^CGCGGAACG /CGTTGC 
A A 

.-AAGCCTTGC-GCAACG 
^CGCGGAACG/ CGTTGC 
A T 



r AAGCCTTGC-GCAACG 
^CGCGGAACG/CGTTGC 
T T 



AAGCCTTGC-GCAACG 
CGCGGAACG /CGTTGC 
T A 



AH 0 
(kcal / mol) 

-46.3 ± 2.6 
-51.6 ± 3.7 



AS 0 
(cal / mol K) 

-127.9 ± 8.7 
-145.4 ± 12.1 



AG° r V 
(kcal /mol) <°C) 



-6.6 = 0.2 
-6.5 = 0.1 



-39.7 ± 
-35.5 ± 



-44.7 ± 
-43.2 ± 



4.6 
2.9 



3.3 
3.7 



-38.5 ± 3.0 
-33.9 ± 1.3 



-43.3 ± 9.0 
-43.9 ± 1.0 



•109.1 ± 
-95.0 ± 



15.2 
5.9 



-126.0 ±11.0 
-121.0 ± 12.2 



-105.1 
-89.6 



10.0 
4.5 



•118.5 ± 30.1 
-120.5 ± 3.4 



-5.8 ± 0.2 
-6.0 ± 1.1 



-5.6 ± 0.2 
-5.7 ± 0.2 



-6.0 ± 0.2 
-6.1 ± 0.1 



-6.5 ± 0.3 
-6.5 ± 0.0 



43.6 
42.3 



-49.2 ± 3.5 -136.6 ± 11.0 -6.9 ± 0.2 44.8 
-45.1 ± 2.9 -123.1 ± 9.4 -6.9 ± 0.1 45.7 



38.2 
39.8 



36.6 
37.2 



39.3 
41.0 



43.1 
43.2 



Interfaces with extra central nucleotide 



r AAGCCTTGCAGCAACG 
^CGCGGAACG /CGTTGC 



« -43.8 ± 9.4 
d -45.0 ± 3.5 



-120.2 ± 31.0 
•124.1 ±11.5 



-6.5 ± 0.2 43.0 
-6.5 ± 0.1 42.7 



AAGCCTTGTACCAACG « -45.0 ± 2.8 



C GCGGAACA /GGTTGC 



-45.0 ± 4.0 



-124.5 ± 9.1 
-124.8 ± 13.4 



•6.3 ± 0.2 41.6 
-6.3 ± 0.2 41.4 
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Table S2: Continued. 



Pli comers 



AH 0 AS 0 AG° 37 T M J 

(kcal/mol) (cal/molK) (kcal.'mol) fC) 



T C A A C G 


« -40.6 


± 


.2-5 


-115.4 


± 


9.1 


-4.8 




0.3 


30.5 


AGT T G C 


d -36.5 


± 


1.9 


-101.2 


± 


6.3 


-5.1 


± 


O.l 


31.9 


.ACAACG- 


« -38.3 




2.0 


•106.8 


± 


6.5 


-5.2 




0.2 


33.4 


TGTTGC 


d -33.9 




1.6 


-91.8 




5.3 


-5.4 


± 


0.1 


34.6 1 


GC A A C G 


c -43.9 


± 


2.4 


•121.6 


± 


7.4 


-6.2 


± 


0.1 


40.7 


CGTTGC 


d -41.1 


± 


2.1 


-112.6 


± 


6.9 


-6.2 


± 


0.1 


41.2 


CCAACG 


c -41.0 


± 


2.6 


-114.1 


± 


8.9 


-5.6 


± 


0.2 


36.4 


GGT T GC 


d -36.2 


± 


0.9 


-98.2 


± 


1.8 


-5.8 


± 


0.3 


37.8 


TGTTGC 


c -38.8 


± 


3.8 


-106.6 


± 


I2.l 


-5.7 


± 


O.l 


37.5 


ACAACG 


d -35.5 




2.8 


-95.7 


± 


9.2 


-5.8 




O.l 


38.4 


AGT T G C 


« -37.1 


± 


2.9 


-101.2 


± 


9.4 


-5.7 


± 


O.l 


36.9 


TCAACG 


d -36.9 


± 


2.4 


-100.8 


± 


8.0 


-5.7 


± 


O.l 


36.9 



1 T M calculated for 4x10"* total strand concentration. 

b The top strand of each system is conventionally represented in the S 1 to 3' orientation. Nucleotides 
involved in coaxial stacking interfaces are represented in bold. 

c Parameters obtained from averaging fits of melting curves. Reported errors are standard deviations 
in the precision of the fitted data. 

* Parameters obtained from T M vs. ln(CV4) plots. Reported errors are standard deviations in the 
precision propagated from the slope and intercept of the 1/T M vs. In (CV4) plot, 
(i). <»i). 0U) iyTw vs i^c^j p i ots for these systems are shown in Figure SI . 
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WHAT IS CLAIMED IS: 

1 LA method for predicting nucleic acid hybridization 

2 thermodynamics, the method comprising: 

3 providing a database of thermodynamics parameters; 

4 receiving hybridization information which represents at least one 

5 sequence; 

6 receiving correction data; 

7 receiving a first set of data which represents hybridization conditions; 

8 and 

9 calculating hybridization thermodynamics including net hybridization 

10 thermodynamics based on the hybridization information, the thermodynamic 

11 parameters, the correction data and the first set of data. 

1 2. The method as claimed in claim 1 wherein the hybridization 

2 thermodynamics of individual single stranded, bimolecular and higher order 

3 complexes are statistically weighted in a numerical process and the equilibrium 

4 concentration of each species is output. 

1 3 . The method as claimed in claim 2 wherein the correction data 

2 includes folding correction data. 

1 4. The method as claimed in claim 2 wherein the correction data 

2 includes linear correction data. 

1 5 . The method as claimed in claim 1 wherein the thermodynamic 

2 parameters include DNA thermodynamic parameters. 

1 6. The method as claimed in claim 5 wherein the DNA 

2 thermodynamic parameters include dangling end parameters. 

1 7. The method as claimed in claim 5 wherein the DNA 

2 thermodynamic parameters include coaxial stacking parameters. 
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1 8. The method as claimed in claim 5 wherein the DNA 

2 thermodynamic parameters include terminal mismatch parameters. 

1 9. The method as claimed in claim 1 wherein the thermodynamic 

2 parameters include RNA thermodynamic parameters. 

1 10. The method as claimed in claim 1 wherein the thermodynamic 

2 parameters include hybrid DNA/RNA thermodynamic parameters. 

1 11. The method as claimed in claim 1 wherein the thermodynamic 

2 parameters include DNA loop thermodynamic parameters. 

1 12. The method as claimed in claim 1 wherein the hybridization 

2 information represents top and bottom strand sequences which form a duplex and 

3 wherein the hybridization thermodynamics are calculated for the duplex. 

1 13. The method as claimed in claim 1 wherein the hybridization 

2 information represents at least a section of a target and a length of at least one 

3 primer or probe complimentary to the target. 

1 14. The method as claimed in claim 13 wherein the hybridization 

2 thermodynamics are calculated for a plurality of primers or probes complimentary 

3 to the target. 

1 15. The method as claimed in claim 1 wherein the hybridization 

2 information represents at least a section of a target and a primer or probe. 

1 16. The method as claimed in claim 15 wherein a length of the 

2 target is longer than a length of the primer or probe and wherein the hybridization 

3 thermodynamics are calculated for a best target/primer or target/probe complex and 

4 for competitive mismatch complexes. 
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1 17. The method as claimed in claim 14 wherein hybridization 

2 information represents at least a section of a target and a primer or probe and 

3 wherein a length of a target is longer than the length of the primer or probe and 

4 wherein the hybridization thermodynamics are calculated for a best target/primer 

5 or target/probe complex and for competitive target/primer or target/probe 

6 complexes. 

1 18. The method as claimed in claim 2 further comprising, 

2 calculating concentration of each species in a solution at a plurality of temperatures. 

1 19. The method as claimed in claim 18 wherein hybridization 

2 information also represents a primer or probe and wherein the length of the target 

3 is longer than a length of the primer or probe and wherein the hybridization 

4 thermodynamics are calculated for a best target/primer or target/probe complex and 

5 for competitive mismatch complexes and wherein the method further comprises 

6 calculating concentration of every species in a solution at a plurality of 

7 temperatures. 

1 20. The method as claimed in claim 19 wherein the hybridization 

2 thermodynamics are calculated for at least two best target/primer or target/probe 

3 complexes and for their corresponding competitive mismatch complexes and 

4 wherein the method further comprises correcting for any interactions between the 

5 at least two best target/primer or target/probe complexes and their components. 

1 21. A system for predicting nucleic acid hybridization 

2 thermodynamics, the system comprising: 

3 a database of thermodynamics parameters; 

4 means for receiving hybridization information which represents at 

5 least one sequence; 

6 means for receiving correction data; 

7 receiving a first set of data which represents hybridization conditions; 

8 and 
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9 means for calculating hybridization thermodynamics including net 

10 hybridization thermodynamics based on the hybridization information, the 

1 1 thermodynamic parameters, the correction data and the first set of data. 

1 22. The system as claimed in claim 21 wherein the hybridization 

2 thermodynamics of individual single stranded, bimolecular and higher order 

3 complexes are statistically weighted in a numerical process and the equilibrium 

4 concentration of each species is output. 

1 23 . The system as claimed in claim 22 wherein the correction data 

2 includes folding correction data. 

1 24 . The system as claimed in claim 22 wherein the correction data 

2 includes linear correction data. 

1 25. The system as claimed in claim 21 wherein the 

2 thermodynamic parameters include DNA thermodynamic parameters. 

1 26. The system as claimed in claim 25 wherein the DNA 

2 thermodynamic parameters include dangling end parameters. 

1 27. The system as claimed in claim 25 wherein the DNA 

2 thermodynamic parameters include coaxial stacking parameters. 

1 28. The system as claimed in claim 25 wherein the DNA 

2 thermodynamic parameters include terminal mismatch parameters. 

1 29. The system as claimed in claim 21 wherein the 

2 thermodynamic parameters include RNA thermodynamic parameters. 

1 30. The system as claimed in claim 21 wherein the 

2 thermodynamic parameters include hybrid. DNA/RNA thermodynamic parameters . 
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1 31. The system as claimed in claim 21 wherein the 

2 thermodynamic parameters include DNA loop thermodynamic parameters. 

1 32. The system as claimed in claim 21 wherein the hybridization 

2 information represents top and bottom strand sequences which form a duplex and 

3 wherein the hybridization thermodynamics are calculated for the duplex. 

1 33. The system as claimed in claim 21 wherein the hybridization 

2 information represents at least a section of a target and a length of at least one 

3 primer or probe complimentary to the target. 

1 34. The system as claimed in claim 33 wherein the hybridization 

2 thermodynamics are calculated for a plurality of primers or probes complimentary 

3 to the target. 

1 35. The system as claimed in claim 21 wherein the hybridization 

2 information represents at least a section of a target and a primer or probe. 

1 36. The system as claimed in claim 35 wherein a length of the 

2 target is longer than a length of the primer or probe and wherein the hybridization 

3 thermodynamics are calculated for a best target/primer or target/probe complex and 

4 for competitive mismatch complexes. 

1 37. The system as claimed in claim 34 wherein hybridization 

2 information represents at least a section of a target and a primer or probe and 

3 wherein a length of a target is longer than the length of the primer or probe and 

4 wherein the hybridization thermodynamics are calculated for a best target/primer 

5 or target/probe complex and for competitive target/primer or target/probe 

6 complexes. 

1 38 . The system as claimed in claim 22 further comprising means 

2 for calculating concentration of each species in a solution at a plurality of 

3 temperatures. 
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1 39. The system as claimed in claim 38 wherein hybridization 

2 information also represents a primer or probe and wherein the length of the target 

3 is longer than a length of the primer or probe and wherein the hybridization 

4 thermodynamics are calculated for a best target/primer or target/probe complex and 

5 for competitive mismatch complexes and wherein the system further comprises 

6 means for calculating concentration of every species in a solution at a plurality of 

7 temperatures. 

1 40. The system as claimed in claim 39 wherein the hybridization 

2 thermodynamics are calculated for at least two best target/primer or target/probe 

3 complexes and for their corresponding competitive mismatch complexes and 

4 wherein the system further comprises means for correcting for any interactions 

5 between the at least two best target/primer or target/probe complexes and their 

6 components. 

1 41. A computer-readable storage medium having stored therein 

2 a database of thermodynamics parameters and a computer program which executes 

3 the steps of: 

4 receiving hybridization information which represents at least one 

5 sequence; 

6 receiving correction data; 

7 receiving a first set of data which represents hybridization conditions; 

8 and 

9 calculating hybridization thermodynamics including net hybridization 

10 thermodynamics based on the hybridization information, the thermodynamic 

1 1 parameters, the correction data and the first set of data. 

1 42. The storage medium as claimed in claim 41 wherein the 

2 hybridization thermodynamics of individual single stranded, bimolecular and higher 

3 order complexes are statistically weighted in a numerical process and the 

4 equilibrium concentration of each species is output. 
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1 43. The storage medium as claimed in claim 42 wherein the 

2 correction data includes folding correction data. 

1 44. The storage medium as claimed in claim 42 wherein the 

2 correction data includes linear correction data. 

1 45. The storage medium as claimed in claim 41 wherein the 

2 thermodynamic parameters include DNA thermodynamic parameters. 

1 46 . The storage medium as claimed in claim 45 wherein the DNA 

2 thermodynamic parameters include dangling end parameters. 

1 47 . The storage medium as claimed in claim 45 wherein the DNA 

2 thermodynamic parameters include coaxial stacking parameters. 

1 48. The storage medium as claimed in claim 41 wherein the DNA 

2 thermodynamic parameters include terminal mismatch parameters. 

1 49. The storage medium as claimed in claim 41 wherein the 

2 thermodynamic parameters include RNA thermodynamic parameters. 

1 50. The storage medium as claimed in claim 41 wherein the 

2 thermodynamic parameters include hybrid DNA/RNA thermodynamic parameters. 

1 51. The storage medium as claimed in claim 41 wherein the 

2 thermodynamic parameters include DNA loop thermodynamic parameters. 

1 52. The storage medium as claimed in claim 41 wherein the 

2 hybridization information represents top and bottom strand sequences which form 

3 a duplex and wherein the hybridization thermodynamics are calculated for the 

4 duplex. 
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1 53. The storage medium as claimed in claim 41 wherein the 

2 hybridization information represents at least a section of a target and a length of at 

3 least one primer or probe complimentary to the target. 

1 54. The storage medium as claimed in claim 53 wherein the 

2 hybridization thermodynamics are calculated for a plurality of primers or probes 

3 complimentary to the target. 

1 55. The storage medium as claimed in claim 41 wherein the 

2 hybridization information represents at least a section of a target and a primer or 

3 probe. 

1 56. The storage medium as claimed in claim 55 wherein a length 

2 of the target is longer than a length of the primer or probe and wherein the 

3 hybridization thermodynamics are calculated for a best target/primer or target/probe 

4 complex and for competitive mismatch complexes. 

1 57. The storage medium as claimed in claim 54 wherein 

2 hybridization information represents at least a section of a target and a primer or 

3 probe and wherein a length of a target is longer than the length of the primer or 

4 probe and wherein the hybridization thermodynamics are calculated for a best 

5 target/primer or target/probe complex and for competitive target/primer or 

6 target/probe complexes. 

1 58. The storage medium as claimed in claim 42 wherein the 

2 program further executes the step of calculating concentration of each species in a 

3 solution at a plurality of temperatures. 

1 59. The storage medium as claimed in claim 58 wherein 

2 hybridization information also represents a primer or probe and wherein the length 

3 of the target is longer than a length of the primer or probe and wherein the 

4 hybridization thermodynamics are calculated for a best target/primer or target/probe 

5 complex and for competitive mismatch complexes and wherein the program 
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6 executes the step of calculating concentration of every species in a solution at a 

7 plurality of temperatures. 

1 60. The storage medium as claimed in claim 59 wherein the 

2 hybridization thermodynamics are calculated for at least two best target/primer or 

3 target/probe complexes and for their corresponding competitive mismatch 

4 complexes and wherein the program executes the step of correcting for any 

5 interactions between the at least two best target/primer or target/probe complexes 

6 and their components. 
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Hybridization Information 

Top Strand/Bottom Strand Types 
DNA/DNA p] 

Top Strand Sequence 5'-3' 

cccaaaaaaaaaaaccg 



Module 1 



Bottom Strand Sequence 5'-3' p] 
[Tj | Use Complement 

♦ggtttttttttttgg* 



B 



JB 



J3 



Hybridization Conditions 

I User defined values for [Na-f] and [Mg2+] m 
[Monovalent cation] ' Q . ios ( 



[Mg 2+ ] 

Hybridization 
Temperature 

[Top Strand ] 
[Bottom Strand] 



'*37'Tc| 



5e-8 



3e-7 



mol/L 
mol/L 

°c 

I mol/L 
I mol/L 



Corrections 

Linear Correction for Micro Chips 
(AG 0 37 (microchip) = a x AG° 37 (solution) + b) 

a= : 1 Zl b = _° I 

Top Strand Folding Correction 

AG °37= 3£i-J kcal/mol AH°= -37.8 | kcal/m 

Bottom Strand Folding Correction 

J kcal/mol AH°= j kcal/m 



AG° 37 = V 



f Predict Thermodynamics j Clear Input 



Figure 2a 
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Duplex sequence 

5 / -CCCAAAAAAAAAAACCG-3 / 
3 / -*GGTTTTTTTTTTTGG*-5 / 

Experimental conditions 



Module 1 



Thermodynamic predictions 



Hybridization type = DNA/DNA 
[Top strand] = 0.5E-07 mol/L 
[Bottom strand] = 0.3E-06 mol/L 

Hybridization temperature = 37.0 °C 



Corrections 



In 1.000 MNaCl: 

AH°=-119.3kcal/mol 
AS° = -335.8 eu 
AG° 370 = -15.14 kcal/mol 

T M = 52.9°C 



Top strand folding: 
AH° = -37.8 kcal/mol 
AG 0 37 0 = -2.10 kcal/mol 



In 0.1050 M NaCl and 0.0000 M 
MgC12: 

A H° = -119.3 kcal/mol 

AS° = -348.3eu 

AG 0 37 0 = -11.29 kcal/mol 

T M = 42.2°C 

The net hybridization thermodynamics 
is: 

A G ° yj Q = -8.74 kcal/mol 
T M = 34.9°C 



Note: 



The net hybridization temperature is the temperature at which the concentration of duplex equals half the 
maximum possible concentration of duplex. 

The net free energy is calculated from the net equilibrium constant at the given temperature: 
Knet=Puplex]/((Ct-[Duplex])*(Cb-p U plex])), where [Duplex] is the concentration of duplex, Ct is the 
initial concentration of top strand, Cb is the initial concentration of bottom strand. 



Figure 2b 
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Hybridization Information Module 2 

Target/Primer Types 
DNA/PNA p] 

Target Sequence 5'-3' 

B 

Primer length 1 
Number of best primers to be displayed J* j 

; B 



Hybridization Conditions 



Corrections 



; User defined values for [Na+] and [Mg2+] p| 

[Monovalent cation] ii 1 mol/L 

[Mg 2+ ] } o" I mol/L 

Hybridization frf'S o r 

Temperature ' * C 

[Target] ^ r ^— • — j 



[Primer] 



; le-6. 



"""" | mol/L 



Linear Correction for Micro Chips 
(AG 0 37 (microchip) = a X AG° 37 (solution) + b) 



a = 



| prgdkt primers . j Clear Input 



Figure 3a 
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Module 2 



Target sequence 



5'-ACCGTTTGTA GTCCGT ACG A CACATAACGG TGCATTC 



Experimental conditions Corrections 



Hybridization type = DNA/DNA No corrections 

[Top strand] = 0.1E-05 mol/L 
[Bottom strand] = 0.1E-05 mol/L 

Hybridization temperature = 37.0 °C 

[Na + ] = 1.0000 mol/L 

[Mg 2 *] = 0.0000 mol/L 

The 2 best primers of length 15 are: 



From position 28 to 42: 5'-GGTTGCAATGCACCG -3' 

AH° = -132.0kcal/mol AS 0 = -355.6 eu AG° 37 0 = -21.71 kcal/mol T M = 
70.2 °C 

From position 35 to 49: 5'-GCAGCATG GTTGCAA -3 ; 

A H° = -124.8 kcal/mol AS 0 = -336.5 eu AG 0 37 0 = -20.42 kcal/mol T M = 
68.4 °C 

Figure 3b 
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Module 3 



Hybridization Information 

Target/Primer type 
DNA/DNA p} • 1 

Target 5'-3' Primer Sequence •„ g-3 1 

• acgcttgaa'tgcagttaatgcc [T] tgaatgcagt "" 



Minimum percent stability of alternative binding sites 
compared to the most stable binding site 

Number of base pairs required to compute the solution jj[ZL! f 



-J3 ; : El 



Hybridization Conditions 

• User defined values for [Na+] and [Mg2+] |7 

[Monovalent cation] [ i^ZIJ mol/L 

[Mg 2+ ] £o J mol/L 
Hybridization 

Temperature ^ u 

[Target] -je-s" ""' v ' j mol/L 

[Primer] 'ie-6" ' | mol/L 



Corrections 

Linear Correction for Micro Chips 
(AG 0 37 (microchip) = a X AG° 37 (solution) + b) 

a= i j b= ' o , | 



j Submit j Clear Input 



Figure 4a 
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Module 3 

Target sequence 



S'-ACGCTTGAAT GCAGTT AATG CC-3 7 

Primer sequence 



3 -TGACGTAAGT-5 

Experimental conditions Corrections 



Hybridization type = DNA/DNA No corrections 

[Top strand] = 0.1E-05 mol/L 
[Bottom strand] = 0.1E-05 mol/L 
Hybridization temperature = 37.0 °C 
[Na + ] = 1.0000 mol/L 

[Mg 2+ ] = 0.0000 mol/L 

Number of base pairs required to compute the 

solution = 5 

Best primer site 



from target position 8 to position 17 

GAATGCAGTTAA 
TGACGTAAGT 



AH° = -26.2 kcal/mol AS 0 = -70.7 eu AG 0 37 0 = -4.28 kcal/mol 
T M = -9.9 °C 

Figure 4b 
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Module 5 



Hybridization Information 

Target/Primer Types 
DNA/DNA frj 

Target Sequence 5'-3' 



B 



Find best primer in sequence section ranging from 
nucleotide number: 



__J to 
Primer length 15 j 

Number of best primer l ( 



10 



Percent stability of alternative binding sites compared to the most stable binding site so 
Number of base pairs required to compute the solution 7 



J 



Hybridization Conditions 

■ User defined values for [Na+] and [Mg2+] [rj 
[Monovalent cation] fi j 



[Mg 2+ ] 

Hybridization 
Temperature 

[Target] 
[Primer] 



2ZJ 

I 3 7 1 p| 



le-6 



mol/L 
mol/L 

°C 
J mol/L 



'ie-s' j mol/L 



Corrections 

Linear Correction for Micro Chips 
(AG° 37 (microchip) = a X AG° 37 (solution) + b) 

J b= o | 



a = 



■. l 



[ Submit \ \ Clear Input 



Figure 5a 
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Target sequence 

. Module 5 



5'- 

AGGTCCATGCTTTGGAACAGCTACTTGAACCGATCATGGACACTGACGGATAAC( 
-3' 

Experimental conditions Corrections 



Hybridization type = DNA/DNA 

[Top strand] = 0.1E-05 moI/L No corrections 

[Bottom strand] = 0.1E-05 mol/L 

Hybridization temperature = 37.0 °C 

[Na+] = 1.0000 mol/L 

[Mg 24 -] = 0.0000 mol/L 

Number of base pairs required to compute the 

solution = 7 

Best primer search area from position 1 to 
position 60 



Best primer # 1: 



from target position 35 to 49 
5 1 -TCATGGACACTGACGGA- 3 1 
3 1 - GTACCTGTGACTGCC - 5 1 

AH° = -123.1 kcal/mol AS° = -331.5 eu AG° 37 0 = -20.27 kcal/mol T M = 
68.4 °C 



Best primer #2: 



from target position 18 to 32 
5 ; - ACAGCTACTTGAACCGA- 3 ' 
3 1 -GTCGATGAACTTGGC-5 1 

•AH° = - 125.0 kcal/mol AS° = -339.6 en AG 0 
66.1 °C 



37 0 = -19.67 kcal/mol T M ; 

Figure 5b 
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initial guess: 



C = C 



Total 



_ n Total 



' 2, C ss t — c t 
temperature T=0 °C 




Multiplex PCR Design 



Pi 



> Gene I 



P3 



> Gene II 



Target DNA 

3X3 



m 



P2 



P4 



Figure 7 
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Prediction of Molecular Beacon 
Hybridization 



Duplex Formation 

CCCAAAAAXAAAAACCG 
+ 

GGTTTTTYTTTTTGG 



X-Y 



*G° 37 T M 



CCCAAAAAXAAAAACCG 
GGTTTTTYTTTTTGG 



A-T -13.24 47.9 

A-A -9.94 39.1 

A-C -9.03 36.0 

A-G -10.36 40.4 



Beacon Folding 



Random Coil Beacon 



Net Hybridization 

Hairpin Beacon 
+ 

Target 



A X A 

A A A A 
A ■ A 
A A 
A A 

C C C C 
C-G 

T-A 

C-G 

G-C 

C-G 

•) 

Hairpin Beacon 



■> — Q-CGTCCCCAAAAAAAAAAACCGACG-^ 

- / l\ 



AG° J7 = -2. 1 kcal/mol T M = 55.2 °C 



3 ' -GGTTTTTXTTTTTGG- 5 ' 



Target 



X-Y 


AG« 37 


(Effective) 


T M (Effective) 




Exp. 


Pred. 


ExjL Fred, 


A-T 


-10.49 


-10.69 


42 42.4 


A-A 


-6.66 


-7.39 


27 26.8 


A-C 


-6.72 


-6.48 


23 21.1 


A-G 


-7.62 


-7.81 


28 29.5 



3'GGI I I I I 1 1 I I I IGG 5 ' 
3 GGTTTTTA I I I I IGG 5 ' 
s'GGTTTTTCI I I I IGG 5 ' 
3 'GGTTTTTGTTTTTGG 5 ' 



0.105 MNaCl 0.001 MMgCl 2 [beacon] = 5x Iff 8 M [target] = 3xl0" 7 M 
Bonnet et al. (1999), Proc. Nat. Acad. Sci. USA 96, 6171-6176 



Figure 8 
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Simulation of Molecular Beacon Hybridization 



1.2 



c 
o 

« 0.8 

S 

a 
o 

o- 0.6 

c 

o 

I 0-4 



0.2 



20 











W— 


♦ F(RC) 
■ F(Dup) 
a F(Hairpin) 
x F(target bound; 











40 60 80 
Temperature 



100 120 



Figure 9 
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Match vs. Mismatch Hybridization 

Match Site Mismatch Site 
1 I I I I 1— i . — I Target DNA 

Probe Probe MM 

Equilibria 

Match Hybrid T + P + 
Mismatch Hybrid T + P^: 
Double Hybrid T+2P ^ 

• Given C Xarget [total], C Probe [total]and the 3 equilibrium constants above, 
it is trivial to solve for the concentrations of all species 

• Since AG° 37 and AH 0 are known, calculate K's at all temperatures 

• Simulate hybridization at all temperatures - optimize specificity 

• More complex model would also include single-strand folding equilibria 

FigurelO 



± T-P 

"p.pMM 

+■ T_p-P MM 



[T-P] 
K M - [T][P] 

[T-pMMj 
K MM = m [p j 



K DH- 



[T-P-P MM ] 

m [pj 2 
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Match vs. Mismatch Hybridization Simulation 



c 
o 

1 

o 
c 
o 
O 

T3 

8 

■— 
CO 

o 
z 



1.00 
0.90 
0.80 
0.70 
0.60 
0.50 
0.40 
0.30 
0.20 
0.10 
0.00 



4 




♦ Fraction Target in R.C. 
■ Fraction Probe in R.C. 
A Fraction Match Hybrid 
O Fraction Mismatch Hybrid 
A Fraction Double bound 



20 
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100 



Temperature (°C) 



Figurell 
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